A2- Part III

School Scores

Author

Sara S

Dataset: School Scores

Introduction

This dataset includes students’ scores in subjects like Arts, English, Mathematics, and Science, along with details about their family income and gender. It helps to understand how these factors affect students academic performance.

Installing Packages

library(tidyverse)
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.1     ✔ tibble    3.2.1
✔ lubridate 1.9.3     ✔ tidyr     1.3.1
✔ purrr     1.0.2     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(mosaic)
Registered S3 method overwritten by 'mosaic':
  method                           from   
  fortify.SpatialPolygonsDataFrame ggplot2

The 'mosaic' package masks several functions from core packages in order to add 
additional features.  The original behavior of these functions should not be affected by this.

Attaching package: 'mosaic'

The following object is masked from 'package:Matrix':

    mean

The following objects are masked from 'package:dplyr':

    count, do, tally

The following object is masked from 'package:purrr':

    cross

The following object is masked from 'package:ggplot2':

    stat

The following objects are masked from 'package:stats':

    binom.test, cor, cor.test, cov, fivenum, IQR, median, prop.test,
    quantile, sd, t.test, var

The following objects are masked from 'package:base':

    max, mean, min, prod, range, sample, sum
library(skimr)

Attaching package: 'skimr'

The following object is masked from 'package:mosaic':

    n_missing
library(ggformula)
library(GGally)
Registered S3 method overwritten by 'GGally':
  method from   
  +.gg   ggplot2
library(janitor)

Attaching package: 'janitor'

The following objects are masked from 'package:stats':

    chisq.test, fisher.test

Importing the dataset

school <- read_csv("../../data/school.csv")
Rows: 577 Columns: 99
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr  (2): State.Code, State.Name
dbl (97): Year, Total.Math, Total.Test-takers, Total.Verbal, Academic Subjec...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
school
# A tibble: 577 × 99
    Year State.Code State.Name       Total.Math `Total.Test-takers` Total.Verbal
   <dbl> <chr>      <chr>                 <dbl>               <dbl>        <dbl>
 1  2005 AL         Alabama                 559                3985          567
 2  2005 AK         Alaska                  519                3996          523
 3  2005 AZ         Arizona                 530               18184          526
 4  2005 AR         Arkansas                552                1600          563
 5  2005 CA         California              522              186552          504
 6  2005 CO         Colorado                560               11990          560
 7  2005 CT         Connecticut             517               34313          517
 8  2005 DE         Delaware                502                6257          503
 9  2005 DC         District Of Col…        478                3622          490
10  2005 FL         Florida                 498               93505          498
# ℹ 567 more rows
# ℹ 93 more variables: `Academic Subjects.Arts/Music.Average GPA` <dbl>,
#   `Academic Subjects.Arts/Music.Average Years` <dbl>,
#   `Academic Subjects.English.Average GPA` <dbl>,
#   `Academic Subjects.English.Average Years` <dbl>,
#   `Academic Subjects.Foreign Languages.Average GPA` <dbl>,
#   `Academic Subjects.Foreign Languages.Average Years` <dbl>, …

Organizing the dataset

glimpse(school)
Rows: 577
Columns: 99
$ Year                                                      <dbl> 2005, 2005, …
$ State.Code                                                <chr> "AL", "AK", …
$ State.Name                                                <chr> "Alabama", "…
$ Total.Math                                                <dbl> 559, 519, 53…
$ `Total.Test-takers`                                       <dbl> 3985, 3996, …
$ Total.Verbal                                              <dbl> 567, 523, 52…
$ `Academic Subjects.Arts/Music.Average GPA`                <dbl> 3.92, 3.76, …
$ `Academic Subjects.Arts/Music.Average Years`              <dbl> 2.2, 1.9, 2.…
$ `Academic Subjects.English.Average GPA`                   <dbl> 3.53, 3.35, …
$ `Academic Subjects.English.Average Years`                 <dbl> 3.9, 3.9, 3.…
$ `Academic Subjects.Foreign Languages.Average GPA`         <dbl> 3.54, 3.34, …
$ `Academic Subjects.Foreign Languages.Average Years`       <dbl> 2.6, 2.1, 2.…
$ `Academic Subjects.Mathematics.Average GPA`               <dbl> 3.41, 3.06, …
$ `Academic Subjects.Mathematics.Average Years`             <dbl> 4.0, 3.5, 3.…
$ `Academic Subjects.Natural Sciences.Average GPA`          <dbl> 3.52, 3.25, …
$ `Academic Subjects.Natural Sciences.Average Years`        <dbl> 3.9, 3.2, 3.…
$ `Academic Subjects.Social Sciences/History.Average GPA`   <dbl> 3.59, 3.39, …
$ `Academic Subjects.Social Sciences/History.Average Years` <dbl> 3.9, 3.4, 3.…
$ `Family Income.Between 20-40k.Math`                       <dbl> 513, 492, 49…
$ `Family Income.Between 20-40k.Test-takers`                <dbl> 324, 401, 21…
$ `Family Income.Between 20-40k.Verbal`                     <dbl> 527, 500, 49…
$ `Family Income.Between 40-60k.Math`                       <dbl> 539, 517, 52…
$ `Family Income.Between 40-60k.Test-takers`                <dbl> 442, 539, 22…
$ `Family Income.Between 40-60k.Verbal`                     <dbl> 551, 522, 51…
$ `Family Income.Between 60-80k.Math`                       <dbl> 550, 513, 52…
$ `Family Income.Between 60-80k.Test-takers`                <dbl> 473, 603, 23…
$ `Family Income.Between 60-80k.Verbal`                     <dbl> 564, 519, 52…
$ `Family Income.Between 80-100k.Math`                      <dbl> 566, 528, 53…
$ `Family Income.Between 80-100k.Test-takers`               <dbl> 475, 444, 18…
$ `Family Income.Between 80-100k.Verbal`                    <dbl> 577, 534, 53…
$ `Family Income.Less than 20k.Math`                        <dbl> 462, 464, 48…
$ `Family Income.Less than 20k.Test-takers`                 <dbl> 175, 191, 89…
$ `Family Income.Less than 20k.Verbal`                      <dbl> 474, 467, 47…
$ `Family Income.More than 100k.Math`                       <dbl> 588, 541, 55…
$ `Family Income.More than 100k.Test-takers`                <dbl> 980, 540, 30…
$ `Family Income.More than 100k.Verbal`                     <dbl> 590, 544, 54…
$ `GPA.A minus.Math`                                        <dbl> 569, 544, 54…
$ `GPA.A minus.Test-takers`                                 <dbl> 724, 673, 33…
$ `GPA.A minus.Verbal`                                      <dbl> 575, 546, 53…
$ `GPA.A plus.Math`                                         <dbl> 622, 600, 60…
$ `GPA.A plus.Test-takers`                                  <dbl> 563, 173, 16…
$ `GPA.A plus.Verbal`                                       <dbl> 623, 604, 59…
$ GPA.A.Math                                                <dbl> 600, 580, 57…
$ `GPA.A.Test-takers`                                       <dbl> 1032, 671, 3…
$ GPA.A.Verbal                                              <dbl> 608, 578, 56…
$ GPA.B.Math                                                <dbl> 514, 492, 49…
$ `GPA.B.Test-takers`                                       <dbl> 1253, 1622, …
$ GPA.B.Verbal                                              <dbl> 525, 499, 49…
$ GPA.C.Math                                                <dbl> 436, 466, 45…
$ `GPA.C.Test-takers`                                       <dbl> 188, 418, 11…
$ GPA.C.Verbal                                              <dbl> 451, 472, 46…
$ `GPA.D or lower.Math`                                     <dbl> 0, 424, 439,…
$ `GPA.D or lower.Test-takers`                              <dbl> 0, 12, 16, 0…
$ `GPA.D or lower.Verbal`                                   <dbl> 0, 466, 435,…
$ `GPA.No response.Math`                                    <dbl> 0, 0, 0, 0, …
$ `GPA.No response.Test-takers`                             <dbl> 225, 427, 91…
$ `GPA.No response.Verbal`                                  <dbl> 0, 0, 0, 0, …
$ Gender.Female.Math                                        <dbl> 538, 505, 51…
$ `Gender.Female.Test-takers`                               <dbl> 2072, 2161, …
$ Gender.Female.Verbal                                      <dbl> 561, 521, 52…
$ Gender.Male.Math                                          <dbl> 582, 535, 54…
$ `Gender.Male.Test-takers`                                 <dbl> 1913, 1835, …
$ Gender.Male.Verbal                                        <dbl> 574, 526, 53…
$ `Score Ranges.Between 200 to 300.Math.Females`            <dbl> 22, 30, 119,…
$ `Score Ranges.Between 200 to 300.Math.Males`              <dbl> 10, 20, 72, …
$ `Score Ranges.Between 200 to 300.Math.Total`              <dbl> 32, 50, 191,…
$ `Score Ranges.Between 200 to 300.Verbal.Females`          <dbl> 14, 26, 115,…
$ `Score Ranges.Between 200 to 300.Verbal.Males`            <dbl> 17, 26, 86, …
$ `Score Ranges.Between 200 to 300.Verbal.Total`            <dbl> 31, 52, 201,…
$ `Score Ranges.Between 300 to 400.Math.Females`            <dbl> 173, 233, 88…
$ `Score Ranges.Between 300 to 400.Math.Males`              <dbl> 93, 153, 450…
$ `Score Ranges.Between 300 to 400.Math.Total`              <dbl> 266, 386, 13…
$ `Score Ranges.Between 300 to 400.Verbal.Females`          <dbl> 123, 218, 73…
$ `Score Ranges.Between 300 to 400.Verbal.Males`            <dbl> 84, 171, 613…
$ `Score Ranges.Between 300 to 400.Verbal.Total`            <dbl> 207, 389, 13…
$ `Score Ranges.Between 400 to 500.Math.Females`            <dbl> 514, 696, 32…
$ `Score Ranges.Between 400 to 500.Math.Males`              <dbl> 293, 485, 19…
$ `Score Ranges.Between 400 to 500.Math.Total`              <dbl> 807, 1181, 5…
$ `Score Ranges.Between 400 to 500.Verbal.Females`          <dbl> 430, 656, 30…
$ `Score Ranges.Between 400 to 500.Verbal.Males`            <dbl> 332, 552, 23…
$ `Score Ranges.Between 400 to 500.Verbal.Total`            <dbl> 762, 1208, 5…
$ `Score Ranges.Between 500 to 600.Math.Females`            <dbl> 722, 813, 35…
$ `Score Ranges.Between 500 to 600.Math.Males`              <dbl> 614, 616, 31…
$ `Score Ranges.Between 500 to 600.Math.Total`              <dbl> 1336, 1429, …
$ `Score Ranges.Between 500 to 600.Verbal.Females`          <dbl> 690, 729, 36…
$ `Score Ranges.Between 500 to 600.Verbal.Males`            <dbl> 617, 596, 31…
$ `Score Ranges.Between 500 to 600.Verbal.Total`            <dbl> 1307, 1325, …
$ `Score Ranges.Between 600 to 700.Math.Females`            <dbl> 485, 342, 16…
$ `Score Ranges.Between 600 to 700.Math.Males`              <dbl> 611, 445, 21…
$ `Score Ranges.Between 600 to 700.Math.Total`              <dbl> 1096, 787, 3…
$ `Score Ranges.Between 600 to 700.Verbal.Females`          <dbl> 596, 423, 18…
$ `Score Ranges.Between 600 to 700.Verbal.Males`            <dbl> 613, 375, 16…
$ `Score Ranges.Between 600 to 700.Verbal.Total`            <dbl> 1209, 798, 3…
$ `Score Ranges.Between 700 to 800.Math.Females`            <dbl> 156, 47, 327…
$ `Score Ranges.Between 700 to 800.Math.Males`              <dbl> 292, 116, 63…
$ `Score Ranges.Between 700 to 800.Math.Total`              <dbl> 448, 163, 95…
$ `Score Ranges.Between 700 to 800.Verbal.Females`          <dbl> 219, 109, 41…
$ `Score Ranges.Between 700 to 800.Verbal.Males`            <dbl> 250, 115, 50…
$ `Score Ranges.Between 700 to 800.Verbal.Total`            <dbl> 469, 224, 91…
inspect(school)

categorical variables:  
        name     class levels   n missing
1 State.Code character     53 577       0
2 State.Name character     53 577       0
                                   distribution
1 AK (1.9%), AL (1.9%), AR (1.9%) ...          
2 Alabama (1.9%), Alaska (1.9%) ...            

quantitative variables:  
                                                      name   class     min
1                                                     Year numeric 2005.00
2                                               Total.Math numeric  383.00
3                                        Total.Test-takers numeric  134.00
4                                             Total.Verbal numeric  401.00
5                 Academic Subjects.Arts/Music.Average GPA numeric    3.43
6               Academic Subjects.Arts/Music.Average Years numeric    1.20
7                    Academic Subjects.English.Average GPA numeric    3.03
8                  Academic Subjects.English.Average Years numeric    3.50
9          Academic Subjects.Foreign Languages.Average GPA numeric    3.03
10       Academic Subjects.Foreign Languages.Average Years numeric    1.80
11               Academic Subjects.Mathematics.Average GPA numeric    2.85
12             Academic Subjects.Mathematics.Average Years numeric    3.20
13          Academic Subjects.Natural Sciences.Average GPA numeric    2.87
14        Academic Subjects.Natural Sciences.Average Years numeric    2.80
15   Academic Subjects.Social Sciences/History.Average GPA numeric    3.05
16 Academic Subjects.Social Sciences/History.Average Years numeric    3.00
17                       Family Income.Between 20-40k.Math numeric    0.00
18                Family Income.Between 20-40k.Test-takers numeric    5.00
19                     Family Income.Between 20-40k.Verbal numeric  387.00
20                       Family Income.Between 40-60k.Math numeric  381.00
21                Family Income.Between 40-60k.Test-takers numeric   10.00
22                     Family Income.Between 40-60k.Verbal numeric  414.00
23                       Family Income.Between 60-80k.Math numeric  249.00
24                Family Income.Between 60-80k.Test-takers numeric    8.00
25                     Family Income.Between 60-80k.Verbal numeric  232.00
26                      Family Income.Between 80-100k.Math numeric  398.00
27               Family Income.Between 80-100k.Test-takers numeric    5.00
28                    Family Income.Between 80-100k.Verbal numeric  433.00
29                        Family Income.Less than 20k.Math numeric    0.00
30                 Family Income.Less than 20k.Test-takers numeric    1.00
31                      Family Income.Less than 20k.Verbal numeric    0.00
32                       Family Income.More than 100k.Math numeric    0.00
33                Family Income.More than 100k.Test-takers numeric    2.00
34                     Family Income.More than 100k.Verbal numeric    0.00
35                                        GPA.A minus.Math numeric    0.00
36                                 GPA.A minus.Test-takers numeric    0.00
37                                      GPA.A minus.Verbal numeric    0.00
38                                         GPA.A plus.Math numeric    0.00
39                                  GPA.A plus.Test-takers numeric    0.00
40                                       GPA.A plus.Verbal numeric    0.00
41                                              GPA.A.Math numeric    0.00
42                                       GPA.A.Test-takers numeric    0.00
43                                            GPA.A.Verbal numeric    0.00
44                                              GPA.B.Math numeric    0.00
45                                       GPA.B.Test-takers numeric    0.00
46                                            GPA.B.Verbal numeric    0.00
47                                              GPA.C.Math numeric    0.00
48                                       GPA.C.Test-takers numeric    0.00
49                                            GPA.C.Verbal numeric    0.00
50                                     GPA.D or lower.Math numeric    0.00
51                              GPA.D or lower.Test-takers numeric    0.00
52                                   GPA.D or lower.Verbal numeric    0.00
53                                    GPA.No response.Math numeric    0.00
54                             GPA.No response.Test-takers numeric    0.00
55                                  GPA.No response.Verbal numeric    0.00
56                                      Gender.Female.Math numeric  368.00
57                               Gender.Female.Test-takers numeric   73.00
58                                    Gender.Female.Verbal numeric  399.00
59                                        Gender.Male.Math numeric  394.00
60                                 Gender.Male.Test-takers numeric   61.00
61                                      Gender.Male.Verbal numeric  403.00
62            Score Ranges.Between 200 to 300.Math.Females numeric    0.00
63              Score Ranges.Between 200 to 300.Math.Males numeric    0.00
64              Score Ranges.Between 200 to 300.Math.Total numeric    0.00
65          Score Ranges.Between 200 to 300.Verbal.Females numeric    0.00
66            Score Ranges.Between 200 to 300.Verbal.Males numeric    0.00
67            Score Ranges.Between 200 to 300.Verbal.Total numeric    0.00
68            Score Ranges.Between 300 to 400.Math.Females numeric    1.00
69              Score Ranges.Between 300 to 400.Math.Males numeric    1.00
70              Score Ranges.Between 300 to 400.Math.Total numeric    0.00
71          Score Ranges.Between 300 to 400.Verbal.Females numeric    1.00
72            Score Ranges.Between 300 to 400.Verbal.Males numeric    1.00
73            Score Ranges.Between 300 to 400.Verbal.Total numeric    2.00
74            Score Ranges.Between 400 to 500.Math.Females numeric    0.00
75              Score Ranges.Between 400 to 500.Math.Males numeric    0.00
76              Score Ranges.Between 400 to 500.Math.Total numeric    0.00
77          Score Ranges.Between 400 to 500.Verbal.Females numeric    1.00
78            Score Ranges.Between 400 to 500.Verbal.Males numeric    2.00
79            Score Ranges.Between 400 to 500.Verbal.Total numeric    0.00
80            Score Ranges.Between 500 to 600.Math.Females numeric    4.00
81              Score Ranges.Between 500 to 600.Math.Males numeric    3.00
82              Score Ranges.Between 500 to 600.Math.Total numeric    6.00
83          Score Ranges.Between 500 to 600.Verbal.Females numeric    4.00
84            Score Ranges.Between 500 to 600.Verbal.Males numeric    4.00
85            Score Ranges.Between 500 to 600.Verbal.Total numeric    1.00
86            Score Ranges.Between 600 to 700.Math.Females numeric   10.00
87              Score Ranges.Between 600 to 700.Math.Males numeric   15.00
88              Score Ranges.Between 600 to 700.Math.Total numeric   26.00
89          Score Ranges.Between 600 to 700.Verbal.Females numeric   13.00
90            Score Ranges.Between 600 to 700.Verbal.Males numeric   10.00
91            Score Ranges.Between 600 to 700.Verbal.Total numeric   23.00
92            Score Ranges.Between 700 to 800.Math.Females numeric    2.00
93              Score Ranges.Between 700 to 800.Math.Males numeric    1.00
94              Score Ranges.Between 700 to 800.Math.Total numeric    1.00
95          Score Ranges.Between 700 to 800.Verbal.Females numeric    2.00
96            Score Ranges.Between 700 to 800.Verbal.Males numeric    2.00
97            Score Ranges.Between 700 to 800.Verbal.Total numeric    4.00
        Q1  median       Q3       max         mean           sd   n missing
1  2007.00 2010.00  2013.00   2015.00  2010.019064 3.169623e+00 577       0
2   504.00  527.00   571.00    619.00   535.682842 4.617161e+01 577       0
3  2536.00 6468.00 35799.00 241553.00 27914.242634 4.560211e+04 577       0
4   496.00  522.00   572.00    612.00   531.334489 4.431830e+01 577       0
5     3.76    3.85     3.90      3.96     3.822704 9.324943e-02 577       0
6     2.10    2.30     2.50      3.10     2.288735 3.191699e-01 577       0
7     3.35    3.51     3.67      3.88     3.500953 1.855612e-01 577       0
8     3.90    3.90     4.00      4.10     3.929463 9.297488e-02 577       0
9     3.30    3.46     3.63      3.79     3.453345 1.891072e-01 577       0
10    2.60    2.80     3.10      3.60     2.850953 3.447069e-01 577       0
11    3.12    3.30     3.51      3.76     3.310312 2.152249e-01 577       0
12    3.80    3.90     4.10      4.40     3.939341 1.682062e-01 577       0
13    3.25    3.42     3.60      3.82     3.418180 1.978311e-01 577       0
14    3.50    3.60     3.80      4.20     3.631889 2.031575e-01 577       0
15    3.38    3.53     3.68      3.88     3.522166 1.775733e-01 577       0
16    3.50    3.60     3.70      4.00     3.618718 1.818971e-01 577       0
17  471.00  495.00   533.00    643.00   500.239168 4.845665e+01 577       0
18  214.00  580.00  3179.00  35446.00  3234.175043 5.935047e+03 577       0
19  466.00  496.00   536.00    634.00   501.327556 4.381485e+01 577       0
20  493.00  519.00   554.00    629.00   522.844021 4.307938e+01 577       0
21  236.00  636.00  3386.00  28124.00  2847.201040 4.638143e+03 577       0
22  489.00  517.00   558.00    628.00   523.083189 4.194743e+01 577       0
23  506.00  531.00   564.00    630.00   533.611785 4.250284e+01 577       0
24  199.00  523.00  2791.00  17937.00  2269.755633 3.535326e+03 577       0
25  501.00  527.00   571.00    645.00   533.845754 4.342085e+01 577       0
26  519.00  539.00   575.00    646.00   547.055459 3.836915e+01 577       0
27  164.00  441.00  2130.00  15358.00  1803.634315 2.855647e+03 577       0
28  514.00  536.00   580.00    651.00   544.582322 3.807276e+01 577       0
29  438.00  465.00   510.00    589.00   461.734835 7.924915e+01 577       0
30  124.00  347.00  2129.00  42551.00  2433.318891 5.254028e+03 577       0
31  429.00  464.00   501.00    579.00   458.433276 6.702153e+01 577       0
32  548.00  565.00   587.00    637.00   565.830156 4.213576e+01 577       0
33  427.00 1000.00  4405.00  46127.00  4141.346620 7.004940e+03 577       0
34  538.00  555.00   590.00    637.00   560.492201 4.258815e+01 577       0
35  532.00  552.00   569.00    619.00   550.032929 3.899533e+01 577       0
36  460.00 1217.00  6372.00  45869.00  4954.838821 8.126753e+03 577       0
37  526.00  547.00   566.00    609.00   543.720971 3.653613e+01 577       0
38  607.00  624.00   644.00    683.00   621.700173 4.297317e+01 577       0
39  274.00  524.00  1792.00  12184.00  1592.202773 2.384094e+03 577       0
40  595.00  616.00   637.00    672.00   613.051993 4.143910e+01 577       0
41  565.00  588.00   605.00    655.00   584.989601 4.178994e+01 577       0
42  680.00 1390.00  6112.00  42656.00  4925.415945 7.645594e+03 577       0
43  556.00  580.00   600.00    637.00   576.849220 3.953681e+01 577       0
44  472.00  492.00   511.00    564.00   490.497400 3.776550e+01 577       0
45  676.00 2282.00 14745.00 104693.00 11728.670711 1.992444e+04 577       0
46  470.00  493.00   517.00    562.00   490.984402 3.658501e+01 577       0
47  413.00  436.00   457.00    553.00   426.221837 7.059629e+01 577       0
48   93.00  445.00  3060.00  22802.00  2619.736568 4.602100e+03 577       0
49  415.00  440.00   464.00    548.00   431.551127 7.161896e+01 577       0
50    0.00  389.00   428.00    648.00   266.636049 2.092454e+02 577       0
51    2.00   12.00    90.00   2061.00    90.762565 2.353662e+02 577       0
52    0.00  394.00   432.00    632.00   272.357019 2.111320e+02 577       0
53    0.00  446.00   496.00    589.00   304.010399 2.363443e+02 577       0
54  107.00  399.00  2206.00  26744.00  1953.188908 3.595058e+03 577       0
55    0.00  462.00   510.00    616.00   315.561525 2.456963e+02 577       0
56  488.00  510.00   551.00    611.00   518.415945 4.421719e+01 577       0
57 1357.00 3428.00 18698.00 133217.00 15011.610052 2.466780e+04 577       0
58  493.00  519.00   569.00    611.00   528.348354 4.347032e+01 577       0
59  521.00  546.00   592.00    640.00   553.911612 4.838629e+01 577       0
60 1177.00 2979.00 16718.00 108336.00 12911.246101 2.095955e+04 577       0
61  499.00  525.00   577.00    635.00   534.883882 4.541287e+01 577       0
62   12.00  102.00   518.00   4294.00   441.296360 7.794464e+02 577       0
63    8.00   60.00   468.00   3034.00   308.422877 5.094988e+02 577       0
64   20.00  162.00   628.00   6772.00   705.206239 1.281787e+03 577       0
65   12.00   46.00   357.00   5111.00   387.736568 7.933545e+02 577       0
66   14.00   83.00   570.00  20348.00   651.103986 1.830084e+03 577       0
67   26.00   74.00   790.00  10603.00   752.656846 1.514439e+03 577       0
68   95.00  599.00  1972.00  24977.00  2180.500867 3.961850e+03 577       0
69   57.00  144.00  1389.00  13740.00  1278.972270 2.396515e+03 577       0
70  149.00  937.00  3288.00  38161.00  3450.223570 6.353179e+03 577       0
71   52.00  206.00  1827.00  22544.00  2017.923744 4.037559e+03 577       0
72   74.00  368.00  2081.00  26188.00  1956.982669 3.578360e+03 577       0
73  110.00  394.00  3530.00  41262.00  3669.703640 7.235214e+03 577       0
74  333.00  670.00  5262.00  43758.00  4597.649913 8.104309e+03 577       0
75  125.00  442.00  3412.00  29254.00  3142.771231 5.642535e+03 577       0
76  493.00 1096.00  8698.00  73012.00  7737.701906 1.374863e+04 577       0
77  198.00  595.00  5082.00  45918.00  4538.656846 8.240606e+03 577       0
78  223.00  892.00  5351.00 164622.00  5540.672444 1.313533e+04 577       0
79  354.00 1063.00  9280.00  80535.00  8190.334489 1.483361e+04 577       0
80  369.00 1006.00  5593.00  35778.00  4332.811092 6.939657e+03 577       0
81  292.00  798.00  5163.00  31702.00  3790.287695 6.212880e+03 577       0
82  651.00 1830.00 10753.00  67480.00  8125.038128 1.313957e+04 577       0
83  356.00  995.00  5835.00  37455.00  4350.618718 6.945720e+03 577       0
84  318.00  871.00  5341.00  31449.00  3760.083189 6.010780e+03 577       0
85  663.00 1888.00 11266.00  68869.00  8112.571924 1.295497e+04 577       0
86  284.00  699.00  3242.00  66431.00  2888.339688 5.908861e+03 577       0
87  328.00  732.00  4178.00  49941.00  3166.878683 5.530240e+03 577       0
88  609.00 1462.00  7308.00 116372.00  6055.634315 1.135802e+04 577       0
89  319.00  718.00  3329.00  71360.00  2914.299827 5.949387e+03 577       0
90  302.00  672.00  3215.00  56513.00  2677.575390 5.129549e+03 577       0
91  617.00 1383.00  6521.00 127873.00  5592.358752 1.106917e+04 577       0
92   83.00  223.00   821.00  24126.00   792.625650 1.787202e+03 577       0
93  163.00  406.00  1475.00  30815.00  1306.644714 2.557591e+03 577       0
94  251.00  645.00  2301.00  54941.00  2099.256499 4.334327e+03 577       0
95  123.00  295.00   987.00  21826.00   849.818024 1.665755e+03 577       0
96  121.00  295.00   970.00  20460.00   847.265165 1.625229e+03 577       0
97  246.00  605.00  1971.00  42286.00  1697.116118 3.289674e+03 577       0
skim(school)
Data summary
Name school
Number of rows 577
Number of columns 99
_______________________
Column type frequency:
character 2
numeric 97
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
State.Code 0 1 2 2 0 53 0
State.Name 0 1 4 20 0 53 0

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
Year 0 1 2010.02 3.17 2005.00 2007.00 2010.00 2013.00 2015.00 ▇▅▅▆▆
Total.Math 0 1 535.68 46.17 383.00 504.00 527.00 571.00 619.00 ▁▁▇▆▅
Total.Test-takers 0 1 27914.24 45602.11 134.00 2536.00 6468.00 35799.00 241553.00 ▇▁▁▁▁
Total.Verbal 0 1 531.33 44.32 401.00 496.00 522.00 572.00 612.00 ▁▂▇▃▅
Academic Subjects.Arts/Music.Average GPA 0 1 3.82 0.09 3.43 3.76 3.85 3.90 3.96 ▁▁▂▆▇
Academic Subjects.Arts/Music.Average Years 0 1 2.29 0.32 1.20 2.10 2.30 2.50 3.10 ▁▂▇▅▂
Academic Subjects.English.Average GPA 0 1 3.50 0.19 3.03 3.35 3.51 3.67 3.88 ▂▆▇▇▃
Academic Subjects.English.Average Years 0 1 3.93 0.09 3.50 3.90 3.90 4.00 4.10 ▁▁▂▇▇
Academic Subjects.Foreign Languages.Average GPA 0 1 3.45 0.19 3.03 3.30 3.46 3.63 3.79 ▃▇▇▇▇
Academic Subjects.Foreign Languages.Average Years 0 1 2.85 0.34 1.80 2.60 2.80 3.10 3.60 ▁▅▆▇▃
Academic Subjects.Mathematics.Average GPA 0 1 3.31 0.22 2.85 3.12 3.30 3.51 3.76 ▂▇▅▇▃
Academic Subjects.Mathematics.Average Years 0 1 3.94 0.17 3.20 3.80 3.90 4.10 4.40 ▁▁▇▆▂
Academic Subjects.Natural Sciences.Average GPA 0 1 3.42 0.20 2.87 3.25 3.42 3.60 3.82 ▁▆▇▇▅
Academic Subjects.Natural Sciences.Average Years 0 1 3.63 0.20 2.80 3.50 3.60 3.80 4.20 ▁▁▇▇▁
Academic Subjects.Social Sciences/History.Average GPA 0 1 3.52 0.18 3.05 3.38 3.53 3.68 3.88 ▁▆▆▇▅
Academic Subjects.Social Sciences/History.Average Years 0 1 3.62 0.18 3.00 3.50 3.60 3.70 4.00 ▁▃▇▇▂
Family Income.Between 20-40k.Math 0 1 500.24 48.46 0.00 471.00 495.00 533.00 643.00 ▁▁▁▇▅
Family Income.Between 20-40k.Test-takers 0 1 3234.18 5935.05 5.00 214.00 580.00 3179.00 35446.00 ▇▁▁▁▁
Family Income.Between 20-40k.Verbal 0 1 501.33 43.81 387.00 466.00 496.00 536.00 634.00 ▁▇▇▅▁
Family Income.Between 40-60k.Math 0 1 522.84 43.08 381.00 493.00 519.00 554.00 629.00 ▁▂▇▆▂
Family Income.Between 40-60k.Test-takers 0 1 2847.20 4638.14 10.00 236.00 636.00 3386.00 28124.00 ▇▁▁▁▁
Family Income.Between 40-60k.Verbal 0 1 523.08 41.95 414.00 489.00 517.00 558.00 628.00 ▁▇▇▇▂
Family Income.Between 60-80k.Math 0 1 533.61 42.50 249.00 506.00 531.00 564.00 630.00 ▁▁▁▇▅
Family Income.Between 60-80k.Test-takers 0 1 2269.76 3535.33 8.00 199.00 523.00 2791.00 17937.00 ▇▁▁▁▁
Family Income.Between 60-80k.Verbal 0 1 533.85 43.42 232.00 501.00 527.00 571.00 645.00 ▁▁▁▇▅
Family Income.Between 80-100k.Math 0 1 547.06 38.37 398.00 519.00 539.00 575.00 646.00 ▁▁▇▅▂
Family Income.Between 80-100k.Test-takers 0 1 1803.63 2855.65 5.00 164.00 441.00 2130.00 15358.00 ▇▁▁▁▁
Family Income.Between 80-100k.Verbal 0 1 544.58 38.07 433.00 514.00 536.00 580.00 651.00 ▁▇▇▇▁
Family Income.Less than 20k.Math 0 1 461.73 79.25 0.00 438.00 465.00 510.00 589.00 ▁▁▁▇▇
Family Income.Less than 20k.Test-takers 0 1 2433.32 5254.03 1.00 124.00 347.00 2129.00 42551.00 ▇▁▁▁▁
Family Income.Less than 20k.Verbal 0 1 458.43 67.02 0.00 429.00 464.00 501.00 579.00 ▁▁▁▇▇
Family Income.More than 100k.Math 0 1 565.83 42.14 0.00 548.00 565.00 587.00 637.00 ▁▁▁▁▇
Family Income.More than 100k.Test-takers 0 1 4141.35 7004.94 2.00 427.00 1000.00 4405.00 46127.00 ▇▁▁▁▁
Family Income.More than 100k.Verbal 0 1 560.49 42.59 0.00 538.00 555.00 590.00 637.00 ▁▁▁▁▇
GPA.A minus.Math 0 1 550.03 39.00 0.00 532.00 552.00 569.00 619.00 ▁▁▁▁▇
GPA.A minus.Test-takers 0 1 4954.84 8126.75 0.00 460.00 1217.00 6372.00 45869.00 ▇▁▁▁▁
GPA.A minus.Verbal 0 1 543.72 36.54 0.00 526.00 547.00 566.00 609.00 ▁▁▁▁▇
GPA.A plus.Math 0 1 621.70 42.97 0.00 607.00 624.00 644.00 683.00 ▁▁▁▁▇
GPA.A plus.Test-takers 0 1 1592.20 2384.09 0.00 274.00 524.00 1792.00 12184.00 ▇▁▁▁▁
GPA.A plus.Verbal 0 1 613.05 41.44 0.00 595.00 616.00 637.00 672.00 ▁▁▁▁▇
GPA.A.Math 0 1 584.99 41.79 0.00 565.00 588.00 605.00 655.00 ▁▁▁▁▇
GPA.A.Test-takers 0 1 4925.42 7645.59 0.00 680.00 1390.00 6112.00 42656.00 ▇▁▁▁▁
GPA.A.Verbal 0 1 576.85 39.54 0.00 556.00 580.00 600.00 637.00 ▁▁▁▁▇
GPA.B.Math 0 1 490.50 37.77 0.00 472.00 492.00 511.00 564.00 ▁▁▁▁▇
GPA.B.Test-takers 0 1 11728.67 19924.44 0.00 676.00 2282.00 14745.00 104693.00 ▇▁▁▁▁
GPA.B.Verbal 0 1 490.98 36.59 0.00 470.00 493.00 517.00 562.00 ▁▁▁▁▇
GPA.C.Math 0 1 426.22 70.60 0.00 413.00 436.00 457.00 553.00 ▁▁▁▇▆
GPA.C.Test-takers 0 1 2619.74 4602.10 0.00 93.00 445.00 3060.00 22802.00 ▇▂▁▁▁
GPA.C.Verbal 0 1 431.55 71.62 0.00 415.00 440.00 464.00 548.00 ▁▁▁▇▇
GPA.D or lower.Math 0 1 266.64 209.25 0.00 0.00 389.00 428.00 648.00 ▆▁▂▇▁
GPA.D or lower.Test-takers 0 1 90.76 235.37 0.00 2.00 12.00 90.00 2061.00 ▇▁▁▁▁
GPA.D or lower.Verbal 0 1 272.36 211.13 0.00 0.00 394.00 432.00 632.00 ▆▁▁▇▁
GPA.No response.Math 0 1 304.01 236.34 0.00 0.00 446.00 496.00 589.00 ▇▁▁▆▇
GPA.No response.Test-takers 0 1 1953.19 3595.06 0.00 107.00 399.00 2206.00 26744.00 ▇▁▁▁▁
GPA.No response.Verbal 0 1 315.56 245.70 0.00 0.00 462.00 510.00 616.00 ▇▁▁▆▇
Gender.Female.Math 0 1 518.42 44.22 368.00 488.00 510.00 551.00 611.00 ▁▁▇▅▃
Gender.Female.Test-takers 0 1 15011.61 24667.80 73.00 1357.00 3428.00 18698.00 133217.00 ▇▁▁▁▁
Gender.Female.Verbal 0 1 528.35 43.47 399.00 493.00 519.00 569.00 611.00 ▁▂▇▃▅
Gender.Male.Math 0 1 553.91 48.39 394.00 521.00 546.00 592.00 640.00 ▁▁▇▆▅
Gender.Male.Test-takers 0 1 12911.25 20959.55 61.00 1177.00 2979.00 16718.00 108336.00 ▇▁▁▁▁
Gender.Male.Verbal 0 1 534.88 45.41 403.00 499.00 525.00 577.00 635.00 ▁▃▇▆▃
Score Ranges.Between 200 to 300.Math.Females 0 1 441.30 779.45 0.00 12.00 102.00 518.00 4294.00 ▇▁▁▁▁
Score Ranges.Between 200 to 300.Math.Males 0 1 308.42 509.50 0.00 8.00 60.00 468.00 3034.00 ▇▁▁▁▁
Score Ranges.Between 200 to 300.Math.Total 0 1 705.21 1281.79 0.00 20.00 162.00 628.00 6772.00 ▇▁▁▁▁
Score Ranges.Between 200 to 300.Verbal.Females 0 1 387.74 793.35 0.00 12.00 46.00 357.00 5111.00 ▇▁▁▁▁
Score Ranges.Between 200 to 300.Verbal.Males 0 1 651.10 1830.08 0.00 14.00 83.00 570.00 20348.00 ▇▁▁▁▁
Score Ranges.Between 200 to 300.Verbal.Total 0 1 752.66 1514.44 0.00 26.00 74.00 790.00 10603.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Math.Females 0 1 2180.50 3961.85 1.00 95.00 599.00 1972.00 24977.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Math.Males 0 1 1278.97 2396.51 1.00 57.00 144.00 1389.00 13740.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Math.Total 0 1 3450.22 6353.18 0.00 149.00 937.00 3288.00 38161.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Verbal.Females 0 1 2017.92 4037.56 1.00 52.00 206.00 1827.00 22544.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Verbal.Males 0 1 1956.98 3578.36 1.00 74.00 368.00 2081.00 26188.00 ▇▁▁▁▁
Score Ranges.Between 300 to 400.Verbal.Total 0 1 3669.70 7235.21 2.00 110.00 394.00 3530.00 41262.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Math.Females 0 1 4597.65 8104.31 0.00 333.00 670.00 5262.00 43758.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Math.Males 0 1 3142.77 5642.54 0.00 125.00 442.00 3412.00 29254.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Math.Total 0 1 7737.70 13748.63 0.00 493.00 1096.00 8698.00 73012.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Verbal.Females 0 1 4538.66 8240.61 1.00 198.00 595.00 5082.00 45918.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Verbal.Males 0 1 5540.67 13135.33 2.00 223.00 892.00 5351.00 164622.00 ▇▁▁▁▁
Score Ranges.Between 400 to 500.Verbal.Total 0 1 8190.33 14833.61 0.00 354.00 1063.00 9280.00 80535.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Math.Females 0 1 4332.81 6939.66 4.00 369.00 1006.00 5593.00 35778.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Math.Males 0 1 3790.29 6212.88 3.00 292.00 798.00 5163.00 31702.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Math.Total 0 1 8125.04 13139.57 6.00 651.00 1830.00 10753.00 67480.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Verbal.Females 0 1 4350.62 6945.72 4.00 356.00 995.00 5835.00 37455.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Verbal.Males 0 1 3760.08 6010.78 4.00 318.00 871.00 5341.00 31449.00 ▇▁▁▁▁
Score Ranges.Between 500 to 600.Verbal.Total 0 1 8112.57 12954.97 1.00 663.00 1888.00 11266.00 68869.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Math.Females 0 1 2888.34 5908.86 10.00 284.00 699.00 3242.00 66431.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Math.Males 0 1 3166.88 5530.24 15.00 328.00 732.00 4178.00 49941.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Math.Total 0 1 6055.63 11358.02 26.00 609.00 1462.00 7308.00 116372.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Verbal.Females 0 1 2914.30 5949.39 13.00 319.00 718.00 3329.00 71360.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Verbal.Males 0 1 2677.58 5129.55 10.00 302.00 672.00 3215.00 56513.00 ▇▁▁▁▁
Score Ranges.Between 600 to 700.Verbal.Total 0 1 5592.36 11069.17 23.00 617.00 1383.00 6521.00 127873.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Math.Females 0 1 792.63 1787.20 2.00 83.00 223.00 821.00 24126.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Math.Males 0 1 1306.64 2557.59 1.00 163.00 406.00 1475.00 30815.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Math.Total 0 1 2099.26 4334.33 1.00 251.00 645.00 2301.00 54941.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Verbal.Females 0 1 849.82 1665.76 2.00 123.00 295.00 987.00 21826.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Verbal.Males 0 1 847.27 1625.23 2.00 121.00 295.00 970.00 20460.00 ▇▁▁▁▁
Score Ranges.Between 700 to 800.Verbal.Total 0 1 1697.12 3289.67 4.00 246.00 605.00 1971.00 42286.00 ▇▁▁▁▁

Cleaning the dataset

cleaned <- school %>%
  clean_names(case = "big_camel")
cleaned
# A tibble: 577 × 99
    Year StateCode StateName            TotalMath TotalTestTakers TotalVerbal
   <dbl> <chr>     <chr>                    <dbl>           <dbl>       <dbl>
 1  2005 AL        Alabama                    559            3985         567
 2  2005 AK        Alaska                     519            3996         523
 3  2005 AZ        Arizona                    530           18184         526
 4  2005 AR        Arkansas                   552            1600         563
 5  2005 CA        California                 522          186552         504
 6  2005 CO        Colorado                   560           11990         560
 7  2005 CT        Connecticut                517           34313         517
 8  2005 DE        Delaware                   502            6257         503
 9  2005 DC        District Of Columbia       478            3622         490
10  2005 FL        Florida                    498           93505         498
# ℹ 567 more rows
# ℹ 93 more variables: AcademicSubjectsArtsMusicAverageGpa <dbl>,
#   AcademicSubjectsArtsMusicAverageYears <dbl>,
#   AcademicSubjectsEnglishAverageGpa <dbl>,
#   AcademicSubjectsEnglishAverageYears <dbl>,
#   AcademicSubjectsForeignLanguagesAverageGpa <dbl>,
#   AcademicSubjectsForeignLanguagesAverageYears <dbl>, …
glimpse(cleaned)
Rows: 577
Columns: 99
$ Year                                              <dbl> 2005, 2005, 2005, 20…
$ StateCode                                         <chr> "AL", "AK", "AZ", "A…
$ StateName                                         <chr> "Alabama", "Alaska",…
$ TotalMath                                         <dbl> 559, 519, 530, 552, …
$ TotalTestTakers                                   <dbl> 3985, 3996, 18184, 1…
$ TotalVerbal                                       <dbl> 567, 523, 526, 563, …
$ AcademicSubjectsArtsMusicAverageGpa               <dbl> 3.92, 3.76, 3.85, 3.…
$ AcademicSubjectsArtsMusicAverageYears             <dbl> 2.2, 1.9, 2.1, 2.2, …
$ AcademicSubjectsEnglishAverageGpa                 <dbl> 3.53, 3.35, 3.45, 3.…
$ AcademicSubjectsEnglishAverageYears               <dbl> 3.9, 3.9, 3.9, 4.0, …
$ AcademicSubjectsForeignLanguagesAverageGpa        <dbl> 3.54, 3.34, 3.41, 3.…
$ AcademicSubjectsForeignLanguagesAverageYears      <dbl> 2.6, 2.1, 2.6, 2.6, …
$ AcademicSubjectsMathematicsAverageGpa             <dbl> 3.41, 3.06, 3.25, 3.…
$ AcademicSubjectsMathematicsAverageYears           <dbl> 4.0, 3.5, 3.9, 4.1, …
$ AcademicSubjectsNaturalSciencesAverageGpa         <dbl> 3.52, 3.25, 3.43, 3.…
$ AcademicSubjectsNaturalSciencesAverageYears       <dbl> 3.9, 3.2, 3.4, 3.7, …
$ AcademicSubjectsSocialSciencesHistoryAverageGpa   <dbl> 3.59, 3.39, 3.55, 3.…
$ AcademicSubjectsSocialSciencesHistoryAverageYears <dbl> 3.9, 3.4, 3.3, 3.6, …
$ FamilyIncomeBetween20_40KMath                     <dbl> 513, 492, 498, 513, …
$ FamilyIncomeBetween20_40KTestTakers               <dbl> 324, 401, 2121, 180,…
$ FamilyIncomeBetween20_40KVerbal                   <dbl> 527, 500, 495, 526, …
$ FamilyIncomeBetween40_60KMath                     <dbl> 539, 517, 520, 543, …
$ FamilyIncomeBetween40_60KTestTakers               <dbl> 442, 539, 2270, 245,…
$ FamilyIncomeBetween40_60KVerbal                   <dbl> 551, 522, 518, 555, …
$ FamilyIncomeBetween60_80KMath                     <dbl> 550, 513, 524, 553, …
$ FamilyIncomeBetween60_80KTestTakers               <dbl> 473, 603, 2372, 227,…
$ FamilyIncomeBetween60_80KVerbal                   <dbl> 564, 519, 523, 570, …
$ FamilyIncomeBetween80_100KMath                    <dbl> 566, 528, 534, 570, …
$ FamilyIncomeBetween80_100KTestTakers              <dbl> 475, 444, 1866, 147,…
$ FamilyIncomeBetween80_100KVerbal                  <dbl> 577, 534, 533, 580, …
$ FamilyIncomeLessThan20KMath                       <dbl> 462, 464, 485, 489, …
$ FamilyIncomeLessThan20KTestTakers                 <dbl> 175, 191, 891, 107, …
$ FamilyIncomeLessThan20KVerbal                     <dbl> 474, 467, 474, 486, …
$ FamilyIncomeMoreThan100KMath                      <dbl> 588, 541, 554, 572, …
$ FamilyIncomeMoreThan100KTestTakers                <dbl> 980, 540, 3083, 314,…
$ FamilyIncomeMoreThan100KVerbal                    <dbl> 590, 544, 546, 589, …
$ GpaAMinusMath                                     <dbl> 569, 544, 541, 559, …
$ GpaAMinusTestTakers                               <dbl> 724, 673, 3334, 298,…
$ GpaAMinusVerbal                                   <dbl> 575, 546, 535, 572, …
$ GpaAPlusMath                                      <dbl> 622, 600, 605, 629, …
$ GpaAPlusTestTakers                                <dbl> 563, 173, 1684, 273,…
$ GpaAPlusVerbal                                    <dbl> 623, 604, 593, 639, …
$ GpaAMath                                          <dbl> 600, 580, 571, 579, …
$ GpaATestTakers                                    <dbl> 1032, 671, 3854, 457…
$ GpaAVerbal                                        <dbl> 608, 578, 563, 583, …
$ GpaBMath                                          <dbl> 514, 492, 498, 492, …
$ GpaBTestTakers                                    <dbl> 1253, 1622, 7193, 43…
$ GpaBVerbal                                        <dbl> 525, 499, 499, 511, …
$ GpaCMath                                          <dbl> 436, 466, 458, 419, …
$ GpaCTestTakers                                    <dbl> 188, 418, 1184, 57, …
$ GpaCVerbal                                        <dbl> 451, 472, 464, 436, …
$ GpaDOrLowerMath                                   <dbl> 0, 424, 439, 0, 419,…
$ GpaDOrLowerTestTakers                             <dbl> 0, 12, 16, 0, 240, 1…
$ GpaDOrLowerVerbal                                 <dbl> 0, 466, 435, 0, 408,…
$ GpaNoResponseMath                                 <dbl> 0, 0, 0, 0, 0, 0, 0,…
$ GpaNoResponseTestTakers                           <dbl> 225, 427, 919, 78, 1…
$ GpaNoResponseVerbal                               <dbl> 0, 0, 0, 0, 0, 0, 0,…
$ GenderFemaleMath                                  <dbl> 538, 505, 513, 536, …
$ GenderFemaleTestTakers                            <dbl> 2072, 2161, 9806, 85…
$ GenderFemaleVerbal                                <dbl> 561, 521, 522, 558, …
$ GenderMaleMath                                    <dbl> 582, 535, 549, 570, …
$ GenderMaleTestTakers                              <dbl> 1913, 1835, 8378, 74…
$ GenderMaleVerbal                                  <dbl> 574, 526, 531, 570, …
$ ScoreRangesBetween200To300MathFemales             <dbl> 22, 30, 119, 12, 297…
$ ScoreRangesBetween200To300MathMales               <dbl> 10, 20, 72, 7, 1453,…
$ ScoreRangesBetween200To300MathTotal               <dbl> 32, 50, 191, 19, 443…
$ ScoreRangesBetween200To300VerbalFemales           <dbl> 14, 26, 115, 9, 3382…
$ ScoreRangesBetween200To300VerbalMales             <dbl> 17, 26, 86, 3, 2433,…
$ ScoreRangesBetween200To300VerbalTotal             <dbl> 31, 52, 201, 12, 581…
$ ScoreRangesBetween300To400MathFemales             <dbl> 173, 233, 881, 68, 1…
$ ScoreRangesBetween300To400MathMales               <dbl> 93, 153, 450, 31, 71…
$ ScoreRangesBetween300To400MathTotal               <dbl> 266, 386, 1331, 99, …
$ ScoreRangesBetween300To400VerbalFemales           <dbl> 123, 218, 739, 46, 1…
$ ScoreRangesBetween300To400VerbalMales             <dbl> 84, 171, 613, 42, 10…
$ ScoreRangesBetween300To400VerbalTotal             <dbl> 207, 389, 1352, 88, …
$ ScoreRangesBetween400To500MathFemales             <dbl> 514, 696, 3215, 210,…
$ ScoreRangesBetween400To500MathMales               <dbl> 293, 485, 1948, 137,…
$ ScoreRangesBetween400To500MathTotal               <dbl> 807, 1181, 5163, 347…
$ ScoreRangesBetween400To500VerbalFemales           <dbl> 430, 656, 3048, 183,…
$ ScoreRangesBetween400To500VerbalMales             <dbl> 332, 552, 2398, 141,…
$ ScoreRangesBetween400To500VerbalTotal             <dbl> 762, 1208, 5446, 324…
$ ScoreRangesBetween500To600MathFemales             <dbl> 722, 813, 3576, 316,…
$ ScoreRangesBetween500To600MathMales               <dbl> 614, 616, 3152, 244,…
$ ScoreRangesBetween500To600MathTotal               <dbl> 1336, 1429, 6728, 56…
$ ScoreRangesBetween500To600VerbalFemales           <dbl> 690, 729, 3661, 302,…
$ ScoreRangesBetween500To600VerbalMales             <dbl> 617, 596, 3101, 236,…
$ ScoreRangesBetween500To600VerbalTotal             <dbl> 1307, 1325, 6762, 53…
$ ScoreRangesBetween600To700MathFemales             <dbl> 485, 342, 1688, 204,…
$ ScoreRangesBetween600To700MathMales               <dbl> 611, 445, 2126, 239,…
$ ScoreRangesBetween600To700MathTotal               <dbl> 1096, 787, 3814, 443…
$ ScoreRangesBetween600To700VerbalFemales           <dbl> 596, 423, 1831, 242,…
$ ScoreRangesBetween600To700VerbalMales             <dbl> 613, 375, 1679, 226,…
$ ScoreRangesBetween600To700VerbalTotal             <dbl> 1209, 798, 3510, 468…
$ ScoreRangesBetween700To800MathFemales             <dbl> 156, 47, 327, 49, 54…
$ ScoreRangesBetween700To800MathMales               <dbl> 292, 116, 630, 83, 8…
$ ScoreRangesBetween700To800MathTotal               <dbl> 448, 163, 957, 132, …
$ ScoreRangesBetween700To800VerbalFemales           <dbl> 219, 109, 412, 77, 5…
$ ScoreRangesBetween700To800VerbalMales             <dbl> 250, 115, 501, 93, 4…
$ ScoreRangesBetween700To800VerbalTotal             <dbl> 469, 224, 913, 170, …
inspect(cleaned)

categorical variables:  
       name     class levels   n missing
1 StateCode character     53 577       0
2 StateName character     53 577       0
                                   distribution
1 AK (1.9%), AL (1.9%), AR (1.9%) ...          
2 Alabama (1.9%), Alaska (1.9%) ...            

quantitative variables:  
                                                name   class     min      Q1
1                                               Year numeric 2005.00 2007.00
2                                          TotalMath numeric  383.00  504.00
3                                    TotalTestTakers numeric  134.00 2536.00
4                                        TotalVerbal numeric  401.00  496.00
5                AcademicSubjectsArtsMusicAverageGpa numeric    3.43    3.76
6              AcademicSubjectsArtsMusicAverageYears numeric    1.20    2.10
7                  AcademicSubjectsEnglishAverageGpa numeric    3.03    3.35
8                AcademicSubjectsEnglishAverageYears numeric    3.50    3.90
9         AcademicSubjectsForeignLanguagesAverageGpa numeric    3.03    3.30
10      AcademicSubjectsForeignLanguagesAverageYears numeric    1.80    2.60
11             AcademicSubjectsMathematicsAverageGpa numeric    2.85    3.12
12           AcademicSubjectsMathematicsAverageYears numeric    3.20    3.80
13         AcademicSubjectsNaturalSciencesAverageGpa numeric    2.87    3.25
14       AcademicSubjectsNaturalSciencesAverageYears numeric    2.80    3.50
15   AcademicSubjectsSocialSciencesHistoryAverageGpa numeric    3.05    3.38
16 AcademicSubjectsSocialSciencesHistoryAverageYears numeric    3.00    3.50
17                     FamilyIncomeBetween20_40KMath numeric    0.00  471.00
18               FamilyIncomeBetween20_40KTestTakers numeric    5.00  214.00
19                   FamilyIncomeBetween20_40KVerbal numeric  387.00  466.00
20                     FamilyIncomeBetween40_60KMath numeric  381.00  493.00
21               FamilyIncomeBetween40_60KTestTakers numeric   10.00  236.00
22                   FamilyIncomeBetween40_60KVerbal numeric  414.00  489.00
23                     FamilyIncomeBetween60_80KMath numeric  249.00  506.00
24               FamilyIncomeBetween60_80KTestTakers numeric    8.00  199.00
25                   FamilyIncomeBetween60_80KVerbal numeric  232.00  501.00
26                    FamilyIncomeBetween80_100KMath numeric  398.00  519.00
27              FamilyIncomeBetween80_100KTestTakers numeric    5.00  164.00
28                  FamilyIncomeBetween80_100KVerbal numeric  433.00  514.00
29                       FamilyIncomeLessThan20KMath numeric    0.00  438.00
30                 FamilyIncomeLessThan20KTestTakers numeric    1.00  124.00
31                     FamilyIncomeLessThan20KVerbal numeric    0.00  429.00
32                      FamilyIncomeMoreThan100KMath numeric    0.00  548.00
33                FamilyIncomeMoreThan100KTestTakers numeric    2.00  427.00
34                    FamilyIncomeMoreThan100KVerbal numeric    0.00  538.00
35                                     GpaAMinusMath numeric    0.00  532.00
36                               GpaAMinusTestTakers numeric    0.00  460.00
37                                   GpaAMinusVerbal numeric    0.00  526.00
38                                      GpaAPlusMath numeric    0.00  607.00
39                                GpaAPlusTestTakers numeric    0.00  274.00
40                                    GpaAPlusVerbal numeric    0.00  595.00
41                                          GpaAMath numeric    0.00  565.00
42                                    GpaATestTakers numeric    0.00  680.00
43                                        GpaAVerbal numeric    0.00  556.00
44                                          GpaBMath numeric    0.00  472.00
45                                    GpaBTestTakers numeric    0.00  676.00
46                                        GpaBVerbal numeric    0.00  470.00
47                                          GpaCMath numeric    0.00  413.00
48                                    GpaCTestTakers numeric    0.00   93.00
49                                        GpaCVerbal numeric    0.00  415.00
50                                   GpaDOrLowerMath numeric    0.00    0.00
51                             GpaDOrLowerTestTakers numeric    0.00    2.00
52                                 GpaDOrLowerVerbal numeric    0.00    0.00
53                                 GpaNoResponseMath numeric    0.00    0.00
54                           GpaNoResponseTestTakers numeric    0.00  107.00
55                               GpaNoResponseVerbal numeric    0.00    0.00
56                                  GenderFemaleMath numeric  368.00  488.00
57                            GenderFemaleTestTakers numeric   73.00 1357.00
58                                GenderFemaleVerbal numeric  399.00  493.00
59                                    GenderMaleMath numeric  394.00  521.00
60                              GenderMaleTestTakers numeric   61.00 1177.00
61                                  GenderMaleVerbal numeric  403.00  499.00
62             ScoreRangesBetween200To300MathFemales numeric    0.00   12.00
63               ScoreRangesBetween200To300MathMales numeric    0.00    8.00
64               ScoreRangesBetween200To300MathTotal numeric    0.00   20.00
65           ScoreRangesBetween200To300VerbalFemales numeric    0.00   12.00
66             ScoreRangesBetween200To300VerbalMales numeric    0.00   14.00
67             ScoreRangesBetween200To300VerbalTotal numeric    0.00   26.00
68             ScoreRangesBetween300To400MathFemales numeric    1.00   95.00
69               ScoreRangesBetween300To400MathMales numeric    1.00   57.00
70               ScoreRangesBetween300To400MathTotal numeric    0.00  149.00
71           ScoreRangesBetween300To400VerbalFemales numeric    1.00   52.00
72             ScoreRangesBetween300To400VerbalMales numeric    1.00   74.00
73             ScoreRangesBetween300To400VerbalTotal numeric    2.00  110.00
74             ScoreRangesBetween400To500MathFemales numeric    0.00  333.00
75               ScoreRangesBetween400To500MathMales numeric    0.00  125.00
76               ScoreRangesBetween400To500MathTotal numeric    0.00  493.00
77           ScoreRangesBetween400To500VerbalFemales numeric    1.00  198.00
78             ScoreRangesBetween400To500VerbalMales numeric    2.00  223.00
79             ScoreRangesBetween400To500VerbalTotal numeric    0.00  354.00
80             ScoreRangesBetween500To600MathFemales numeric    4.00  369.00
81               ScoreRangesBetween500To600MathMales numeric    3.00  292.00
82               ScoreRangesBetween500To600MathTotal numeric    6.00  651.00
83           ScoreRangesBetween500To600VerbalFemales numeric    4.00  356.00
84             ScoreRangesBetween500To600VerbalMales numeric    4.00  318.00
85             ScoreRangesBetween500To600VerbalTotal numeric    1.00  663.00
86             ScoreRangesBetween600To700MathFemales numeric   10.00  284.00
87               ScoreRangesBetween600To700MathMales numeric   15.00  328.00
88               ScoreRangesBetween600To700MathTotal numeric   26.00  609.00
89           ScoreRangesBetween600To700VerbalFemales numeric   13.00  319.00
90             ScoreRangesBetween600To700VerbalMales numeric   10.00  302.00
91             ScoreRangesBetween600To700VerbalTotal numeric   23.00  617.00
92             ScoreRangesBetween700To800MathFemales numeric    2.00   83.00
93               ScoreRangesBetween700To800MathMales numeric    1.00  163.00
94               ScoreRangesBetween700To800MathTotal numeric    1.00  251.00
95           ScoreRangesBetween700To800VerbalFemales numeric    2.00  123.00
96             ScoreRangesBetween700To800VerbalMales numeric    2.00  121.00
97             ScoreRangesBetween700To800VerbalTotal numeric    4.00  246.00
    median       Q3       max         mean           sd   n missing
1  2010.00  2013.00   2015.00  2010.019064 3.169623e+00 577       0
2   527.00   571.00    619.00   535.682842 4.617161e+01 577       0
3  6468.00 35799.00 241553.00 27914.242634 4.560211e+04 577       0
4   522.00   572.00    612.00   531.334489 4.431830e+01 577       0
5     3.85     3.90      3.96     3.822704 9.324943e-02 577       0
6     2.30     2.50      3.10     2.288735 3.191699e-01 577       0
7     3.51     3.67      3.88     3.500953 1.855612e-01 577       0
8     3.90     4.00      4.10     3.929463 9.297488e-02 577       0
9     3.46     3.63      3.79     3.453345 1.891072e-01 577       0
10    2.80     3.10      3.60     2.850953 3.447069e-01 577       0
11    3.30     3.51      3.76     3.310312 2.152249e-01 577       0
12    3.90     4.10      4.40     3.939341 1.682062e-01 577       0
13    3.42     3.60      3.82     3.418180 1.978311e-01 577       0
14    3.60     3.80      4.20     3.631889 2.031575e-01 577       0
15    3.53     3.68      3.88     3.522166 1.775733e-01 577       0
16    3.60     3.70      4.00     3.618718 1.818971e-01 577       0
17  495.00   533.00    643.00   500.239168 4.845665e+01 577       0
18  580.00  3179.00  35446.00  3234.175043 5.935047e+03 577       0
19  496.00   536.00    634.00   501.327556 4.381485e+01 577       0
20  519.00   554.00    629.00   522.844021 4.307938e+01 577       0
21  636.00  3386.00  28124.00  2847.201040 4.638143e+03 577       0
22  517.00   558.00    628.00   523.083189 4.194743e+01 577       0
23  531.00   564.00    630.00   533.611785 4.250284e+01 577       0
24  523.00  2791.00  17937.00  2269.755633 3.535326e+03 577       0
25  527.00   571.00    645.00   533.845754 4.342085e+01 577       0
26  539.00   575.00    646.00   547.055459 3.836915e+01 577       0
27  441.00  2130.00  15358.00  1803.634315 2.855647e+03 577       0
28  536.00   580.00    651.00   544.582322 3.807276e+01 577       0
29  465.00   510.00    589.00   461.734835 7.924915e+01 577       0
30  347.00  2129.00  42551.00  2433.318891 5.254028e+03 577       0
31  464.00   501.00    579.00   458.433276 6.702153e+01 577       0
32  565.00   587.00    637.00   565.830156 4.213576e+01 577       0
33 1000.00  4405.00  46127.00  4141.346620 7.004940e+03 577       0
34  555.00   590.00    637.00   560.492201 4.258815e+01 577       0
35  552.00   569.00    619.00   550.032929 3.899533e+01 577       0
36 1217.00  6372.00  45869.00  4954.838821 8.126753e+03 577       0
37  547.00   566.00    609.00   543.720971 3.653613e+01 577       0
38  624.00   644.00    683.00   621.700173 4.297317e+01 577       0
39  524.00  1792.00  12184.00  1592.202773 2.384094e+03 577       0
40  616.00   637.00    672.00   613.051993 4.143910e+01 577       0
41  588.00   605.00    655.00   584.989601 4.178994e+01 577       0
42 1390.00  6112.00  42656.00  4925.415945 7.645594e+03 577       0
43  580.00   600.00    637.00   576.849220 3.953681e+01 577       0
44  492.00   511.00    564.00   490.497400 3.776550e+01 577       0
45 2282.00 14745.00 104693.00 11728.670711 1.992444e+04 577       0
46  493.00   517.00    562.00   490.984402 3.658501e+01 577       0
47  436.00   457.00    553.00   426.221837 7.059629e+01 577       0
48  445.00  3060.00  22802.00  2619.736568 4.602100e+03 577       0
49  440.00   464.00    548.00   431.551127 7.161896e+01 577       0
50  389.00   428.00    648.00   266.636049 2.092454e+02 577       0
51   12.00    90.00   2061.00    90.762565 2.353662e+02 577       0
52  394.00   432.00    632.00   272.357019 2.111320e+02 577       0
53  446.00   496.00    589.00   304.010399 2.363443e+02 577       0
54  399.00  2206.00  26744.00  1953.188908 3.595058e+03 577       0
55  462.00   510.00    616.00   315.561525 2.456963e+02 577       0
56  510.00   551.00    611.00   518.415945 4.421719e+01 577       0
57 3428.00 18698.00 133217.00 15011.610052 2.466780e+04 577       0
58  519.00   569.00    611.00   528.348354 4.347032e+01 577       0
59  546.00   592.00    640.00   553.911612 4.838629e+01 577       0
60 2979.00 16718.00 108336.00 12911.246101 2.095955e+04 577       0
61  525.00   577.00    635.00   534.883882 4.541287e+01 577       0
62  102.00   518.00   4294.00   441.296360 7.794464e+02 577       0
63   60.00   468.00   3034.00   308.422877 5.094988e+02 577       0
64  162.00   628.00   6772.00   705.206239 1.281787e+03 577       0
65   46.00   357.00   5111.00   387.736568 7.933545e+02 577       0
66   83.00   570.00  20348.00   651.103986 1.830084e+03 577       0
67   74.00   790.00  10603.00   752.656846 1.514439e+03 577       0
68  599.00  1972.00  24977.00  2180.500867 3.961850e+03 577       0
69  144.00  1389.00  13740.00  1278.972270 2.396515e+03 577       0
70  937.00  3288.00  38161.00  3450.223570 6.353179e+03 577       0
71  206.00  1827.00  22544.00  2017.923744 4.037559e+03 577       0
72  368.00  2081.00  26188.00  1956.982669 3.578360e+03 577       0
73  394.00  3530.00  41262.00  3669.703640 7.235214e+03 577       0
74  670.00  5262.00  43758.00  4597.649913 8.104309e+03 577       0
75  442.00  3412.00  29254.00  3142.771231 5.642535e+03 577       0
76 1096.00  8698.00  73012.00  7737.701906 1.374863e+04 577       0
77  595.00  5082.00  45918.00  4538.656846 8.240606e+03 577       0
78  892.00  5351.00 164622.00  5540.672444 1.313533e+04 577       0
79 1063.00  9280.00  80535.00  8190.334489 1.483361e+04 577       0
80 1006.00  5593.00  35778.00  4332.811092 6.939657e+03 577       0
81  798.00  5163.00  31702.00  3790.287695 6.212880e+03 577       0
82 1830.00 10753.00  67480.00  8125.038128 1.313957e+04 577       0
83  995.00  5835.00  37455.00  4350.618718 6.945720e+03 577       0
84  871.00  5341.00  31449.00  3760.083189 6.010780e+03 577       0
85 1888.00 11266.00  68869.00  8112.571924 1.295497e+04 577       0
86  699.00  3242.00  66431.00  2888.339688 5.908861e+03 577       0
87  732.00  4178.00  49941.00  3166.878683 5.530240e+03 577       0
88 1462.00  7308.00 116372.00  6055.634315 1.135802e+04 577       0
89  718.00  3329.00  71360.00  2914.299827 5.949387e+03 577       0
90  672.00  3215.00  56513.00  2677.575390 5.129549e+03 577       0
91 1383.00  6521.00 127873.00  5592.358752 1.106917e+04 577       0
92  223.00   821.00  24126.00   792.625650 1.787202e+03 577       0
93  406.00  1475.00  30815.00  1306.644714 2.557591e+03 577       0
94  645.00  2301.00  54941.00  2099.256499 4.334327e+03 577       0
95  295.00   987.00  21826.00   849.818024 1.665755e+03 577       0
96  295.00   970.00  20460.00   847.265165 1.625229e+03 577       0
97  605.00  1971.00  42286.00  1697.116118 3.289674e+03 577       0
skim(cleaned)
Data summary
Name cleaned
Number of rows 577
Number of columns 99
_______________________
Column type frequency:
character 2
numeric 97
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
StateCode 0 1 2 2 0 53 0
StateName 0 1 4 20 0 53 0

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
Year 0 1 2010.02 3.17 2005.00 2007.00 2010.00 2013.00 2015.00 ▇▅▅▆▆
TotalMath 0 1 535.68 46.17 383.00 504.00 527.00 571.00 619.00 ▁▁▇▆▅
TotalTestTakers 0 1 27914.24 45602.11 134.00 2536.00 6468.00 35799.00 241553.00 ▇▁▁▁▁
TotalVerbal 0 1 531.33 44.32 401.00 496.00 522.00 572.00 612.00 ▁▂▇▃▅
AcademicSubjectsArtsMusicAverageGpa 0 1 3.82 0.09 3.43 3.76 3.85 3.90 3.96 ▁▁▂▆▇
AcademicSubjectsArtsMusicAverageYears 0 1 2.29 0.32 1.20 2.10 2.30 2.50 3.10 ▁▂▇▅▂
AcademicSubjectsEnglishAverageGpa 0 1 3.50 0.19 3.03 3.35 3.51 3.67 3.88 ▂▆▇▇▃
AcademicSubjectsEnglishAverageYears 0 1 3.93 0.09 3.50 3.90 3.90 4.00 4.10 ▁▁▂▇▇
AcademicSubjectsForeignLanguagesAverageGpa 0 1 3.45 0.19 3.03 3.30 3.46 3.63 3.79 ▃▇▇▇▇
AcademicSubjectsForeignLanguagesAverageYears 0 1 2.85 0.34 1.80 2.60 2.80 3.10 3.60 ▁▅▆▇▃
AcademicSubjectsMathematicsAverageGpa 0 1 3.31 0.22 2.85 3.12 3.30 3.51 3.76 ▂▇▅▇▃
AcademicSubjectsMathematicsAverageYears 0 1 3.94 0.17 3.20 3.80 3.90 4.10 4.40 ▁▁▇▆▂
AcademicSubjectsNaturalSciencesAverageGpa 0 1 3.42 0.20 2.87 3.25 3.42 3.60 3.82 ▁▆▇▇▅
AcademicSubjectsNaturalSciencesAverageYears 0 1 3.63 0.20 2.80 3.50 3.60 3.80 4.20 ▁▁▇▇▁
AcademicSubjectsSocialSciencesHistoryAverageGpa 0 1 3.52 0.18 3.05 3.38 3.53 3.68 3.88 ▁▆▆▇▅
AcademicSubjectsSocialSciencesHistoryAverageYears 0 1 3.62 0.18 3.00 3.50 3.60 3.70 4.00 ▁▃▇▇▂
FamilyIncomeBetween20_40KMath 0 1 500.24 48.46 0.00 471.00 495.00 533.00 643.00 ▁▁▁▇▅
FamilyIncomeBetween20_40KTestTakers 0 1 3234.18 5935.05 5.00 214.00 580.00 3179.00 35446.00 ▇▁▁▁▁
FamilyIncomeBetween20_40KVerbal 0 1 501.33 43.81 387.00 466.00 496.00 536.00 634.00 ▁▇▇▅▁
FamilyIncomeBetween40_60KMath 0 1 522.84 43.08 381.00 493.00 519.00 554.00 629.00 ▁▂▇▆▂
FamilyIncomeBetween40_60KTestTakers 0 1 2847.20 4638.14 10.00 236.00 636.00 3386.00 28124.00 ▇▁▁▁▁
FamilyIncomeBetween40_60KVerbal 0 1 523.08 41.95 414.00 489.00 517.00 558.00 628.00 ▁▇▇▇▂
FamilyIncomeBetween60_80KMath 0 1 533.61 42.50 249.00 506.00 531.00 564.00 630.00 ▁▁▁▇▅
FamilyIncomeBetween60_80KTestTakers 0 1 2269.76 3535.33 8.00 199.00 523.00 2791.00 17937.00 ▇▁▁▁▁
FamilyIncomeBetween60_80KVerbal 0 1 533.85 43.42 232.00 501.00 527.00 571.00 645.00 ▁▁▁▇▅
FamilyIncomeBetween80_100KMath 0 1 547.06 38.37 398.00 519.00 539.00 575.00 646.00 ▁▁▇▅▂
FamilyIncomeBetween80_100KTestTakers 0 1 1803.63 2855.65 5.00 164.00 441.00 2130.00 15358.00 ▇▁▁▁▁
FamilyIncomeBetween80_100KVerbal 0 1 544.58 38.07 433.00 514.00 536.00 580.00 651.00 ▁▇▇▇▁
FamilyIncomeLessThan20KMath 0 1 461.73 79.25 0.00 438.00 465.00 510.00 589.00 ▁▁▁▇▇
FamilyIncomeLessThan20KTestTakers 0 1 2433.32 5254.03 1.00 124.00 347.00 2129.00 42551.00 ▇▁▁▁▁
FamilyIncomeLessThan20KVerbal 0 1 458.43 67.02 0.00 429.00 464.00 501.00 579.00 ▁▁▁▇▇
FamilyIncomeMoreThan100KMath 0 1 565.83 42.14 0.00 548.00 565.00 587.00 637.00 ▁▁▁▁▇
FamilyIncomeMoreThan100KTestTakers 0 1 4141.35 7004.94 2.00 427.00 1000.00 4405.00 46127.00 ▇▁▁▁▁
FamilyIncomeMoreThan100KVerbal 0 1 560.49 42.59 0.00 538.00 555.00 590.00 637.00 ▁▁▁▁▇
GpaAMinusMath 0 1 550.03 39.00 0.00 532.00 552.00 569.00 619.00 ▁▁▁▁▇
GpaAMinusTestTakers 0 1 4954.84 8126.75 0.00 460.00 1217.00 6372.00 45869.00 ▇▁▁▁▁
GpaAMinusVerbal 0 1 543.72 36.54 0.00 526.00 547.00 566.00 609.00 ▁▁▁▁▇
GpaAPlusMath 0 1 621.70 42.97 0.00 607.00 624.00 644.00 683.00 ▁▁▁▁▇
GpaAPlusTestTakers 0 1 1592.20 2384.09 0.00 274.00 524.00 1792.00 12184.00 ▇▁▁▁▁
GpaAPlusVerbal 0 1 613.05 41.44 0.00 595.00 616.00 637.00 672.00 ▁▁▁▁▇
GpaAMath 0 1 584.99 41.79 0.00 565.00 588.00 605.00 655.00 ▁▁▁▁▇
GpaATestTakers 0 1 4925.42 7645.59 0.00 680.00 1390.00 6112.00 42656.00 ▇▁▁▁▁
GpaAVerbal 0 1 576.85 39.54 0.00 556.00 580.00 600.00 637.00 ▁▁▁▁▇
GpaBMath 0 1 490.50 37.77 0.00 472.00 492.00 511.00 564.00 ▁▁▁▁▇
GpaBTestTakers 0 1 11728.67 19924.44 0.00 676.00 2282.00 14745.00 104693.00 ▇▁▁▁▁
GpaBVerbal 0 1 490.98 36.59 0.00 470.00 493.00 517.00 562.00 ▁▁▁▁▇
GpaCMath 0 1 426.22 70.60 0.00 413.00 436.00 457.00 553.00 ▁▁▁▇▆
GpaCTestTakers 0 1 2619.74 4602.10 0.00 93.00 445.00 3060.00 22802.00 ▇▂▁▁▁
GpaCVerbal 0 1 431.55 71.62 0.00 415.00 440.00 464.00 548.00 ▁▁▁▇▇
GpaDOrLowerMath 0 1 266.64 209.25 0.00 0.00 389.00 428.00 648.00 ▆▁▂▇▁
GpaDOrLowerTestTakers 0 1 90.76 235.37 0.00 2.00 12.00 90.00 2061.00 ▇▁▁▁▁
GpaDOrLowerVerbal 0 1 272.36 211.13 0.00 0.00 394.00 432.00 632.00 ▆▁▁▇▁
GpaNoResponseMath 0 1 304.01 236.34 0.00 0.00 446.00 496.00 589.00 ▇▁▁▆▇
GpaNoResponseTestTakers 0 1 1953.19 3595.06 0.00 107.00 399.00 2206.00 26744.00 ▇▁▁▁▁
GpaNoResponseVerbal 0 1 315.56 245.70 0.00 0.00 462.00 510.00 616.00 ▇▁▁▆▇
GenderFemaleMath 0 1 518.42 44.22 368.00 488.00 510.00 551.00 611.00 ▁▁▇▅▃
GenderFemaleTestTakers 0 1 15011.61 24667.80 73.00 1357.00 3428.00 18698.00 133217.00 ▇▁▁▁▁
GenderFemaleVerbal 0 1 528.35 43.47 399.00 493.00 519.00 569.00 611.00 ▁▂▇▃▅
GenderMaleMath 0 1 553.91 48.39 394.00 521.00 546.00 592.00 640.00 ▁▁▇▆▅
GenderMaleTestTakers 0 1 12911.25 20959.55 61.00 1177.00 2979.00 16718.00 108336.00 ▇▁▁▁▁
GenderMaleVerbal 0 1 534.88 45.41 403.00 499.00 525.00 577.00 635.00 ▁▃▇▆▃
ScoreRangesBetween200To300MathFemales 0 1 441.30 779.45 0.00 12.00 102.00 518.00 4294.00 ▇▁▁▁▁
ScoreRangesBetween200To300MathMales 0 1 308.42 509.50 0.00 8.00 60.00 468.00 3034.00 ▇▁▁▁▁
ScoreRangesBetween200To300MathTotal 0 1 705.21 1281.79 0.00 20.00 162.00 628.00 6772.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalFemales 0 1 387.74 793.35 0.00 12.00 46.00 357.00 5111.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalMales 0 1 651.10 1830.08 0.00 14.00 83.00 570.00 20348.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalTotal 0 1 752.66 1514.44 0.00 26.00 74.00 790.00 10603.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathFemales 0 1 2180.50 3961.85 1.00 95.00 599.00 1972.00 24977.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathMales 0 1 1278.97 2396.51 1.00 57.00 144.00 1389.00 13740.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathTotal 0 1 3450.22 6353.18 0.00 149.00 937.00 3288.00 38161.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalFemales 0 1 2017.92 4037.56 1.00 52.00 206.00 1827.00 22544.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalMales 0 1 1956.98 3578.36 1.00 74.00 368.00 2081.00 26188.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalTotal 0 1 3669.70 7235.21 2.00 110.00 394.00 3530.00 41262.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathFemales 0 1 4597.65 8104.31 0.00 333.00 670.00 5262.00 43758.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathMales 0 1 3142.77 5642.54 0.00 125.00 442.00 3412.00 29254.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathTotal 0 1 7737.70 13748.63 0.00 493.00 1096.00 8698.00 73012.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalFemales 0 1 4538.66 8240.61 1.00 198.00 595.00 5082.00 45918.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalMales 0 1 5540.67 13135.33 2.00 223.00 892.00 5351.00 164622.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalTotal 0 1 8190.33 14833.61 0.00 354.00 1063.00 9280.00 80535.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathFemales 0 1 4332.81 6939.66 4.00 369.00 1006.00 5593.00 35778.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathMales 0 1 3790.29 6212.88 3.00 292.00 798.00 5163.00 31702.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathTotal 0 1 8125.04 13139.57 6.00 651.00 1830.00 10753.00 67480.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalFemales 0 1 4350.62 6945.72 4.00 356.00 995.00 5835.00 37455.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalMales 0 1 3760.08 6010.78 4.00 318.00 871.00 5341.00 31449.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalTotal 0 1 8112.57 12954.97 1.00 663.00 1888.00 11266.00 68869.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathFemales 0 1 2888.34 5908.86 10.00 284.00 699.00 3242.00 66431.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathMales 0 1 3166.88 5530.24 15.00 328.00 732.00 4178.00 49941.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathTotal 0 1 6055.63 11358.02 26.00 609.00 1462.00 7308.00 116372.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalFemales 0 1 2914.30 5949.39 13.00 319.00 718.00 3329.00 71360.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalMales 0 1 2677.58 5129.55 10.00 302.00 672.00 3215.00 56513.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalTotal 0 1 5592.36 11069.17 23.00 617.00 1383.00 6521.00 127873.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathFemales 0 1 792.63 1787.20 2.00 83.00 223.00 821.00 24126.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathMales 0 1 1306.64 2557.59 1.00 163.00 406.00 1475.00 30815.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathTotal 0 1 2099.26 4334.33 1.00 251.00 645.00 2301.00 54941.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalFemales 0 1 849.82 1665.76 2.00 123.00 295.00 987.00 21826.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalMales 0 1 847.27 1625.23 2.00 121.00 295.00 970.00 20460.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalTotal 0 1 1697.12 3289.67 4.00 246.00 605.00 1971.00 42286.00 ▇▁▁▁▁

Data Dictionary

Quantitative Variables

  • TotalMath <dbl>: This variable represents the total math scores of students.

  • TotalVerbal <dbl>: This variable represents the total verbal scores of students.

  • TotalTestTakers <dbl>: This variable indicates the number of test takers.

  • AcademicSubjectsMathematicsAverageGpa <dbl>: This variable represents the average GPA for mathematics.

  • AcademicSubjectsEnglishAverageGpa <dbl>: This variable represents the average GPA for English.

  • AcademicSubjectsNaturalSciencesAverageGpa <dbl>: This variable represents the average GPA for natural sciences.

  • AcademicSubjectsArtsMusicAverageGpa <dbl>: This variable represents the average GPA for arts and music subjects.

  • AcademicSubjectsForeignLanguagesAverageGpa <dbl>: This variable represents the average GPA of students in foreign language subjects.

  • AcademicSubjectsSocialSciencesHistoryAverageGpa <dbl>: This variable indicates the average GPA of students in social sciences and history subjects.

  • FamilyIncomeLessThan20KMath <dbl>: Math scores for students from families with income below $20,000.

  • FamilyIncomeBetween20_40KMath <dbl>: Math scores for students from families with income between $20,000 and $40,000.

  • FamilyIncomeBetween40_60KMath <dbl>: Math scores for families with income between $40,000 and $60,000.

  • FamilyIncomeBetween60_80KMath <dbl>: Math scores for families with income between $60,000 and $80,000.

  • FamilyIncomeBetween80_100KMath <dbl>: Math scores for families with income between $80,000 and $100,000.

  • FamilyIncomeMoreThan100KMath <dbl>: Math scores for students from families with income above $100,000.

  • GpaAPlusMath <dbl>: Math scores for students with an A-plus GPA.

  • GpaAMath <dbl>: Math scores for students with an A GPA.

  • GpaBMath <dbl>: Math scores for students with a B GPA.

  • GenderFemaleMath <dbl>: Math scores for female students.

  • GenderMaleMath <dbl>: Math scores for male students.

  • GenderFemaleVerbal <dbl>: Verbal scores for female students.

  • GenderMaleVerbal <dbl>: Verbal scores for male student.

  • GpaAPlusVerbal <dbl>: Verbal scores for students with an A-plus GPA.

  • GpaAVerbal <dbl>: Verbal scores for students with an A GPA.

  • GpaBVerbal <dbl>: Verbal scores for students with a B GPA.

  • GpaDOrLowerMath <dbl>: Math scores for students with a D or lower GPA.

  • GpaDOrLowerVerbal <dbl>: Verbal scores for students with a D or lower GPA.

  • ScoreRangesBetween300To400VerbalTotal <dbl>: Total number of students scoring between 300 and 400 in verbal.

  • ScoreRangesBetween400To500VerbalTotal <dbl>: Total number of students scoring between 400 and 500 in verbal.

  • ScoreRangesBetween300To400MathTotal <dbl>: Total number of students scoring between 300 and 400 in math.

  • ScoreRangesBetween400To500MathTotal <dbl>: Total number of students scoring between 400 and 500 in math.

Qualitative Variables

  • StateCode <chr>: This variable contains abbreviations of state names such as “AL” for Alabama and “AK” for Alaska.

  • StateName <chr>: This variable contains the full names of states like “Alabama” and “Alaska”.

  • Year <chr>: Although it is usually considered as a numerical value, the Year variable is treated as categorical since it represents distinct time points rather than continuous data.

Factorizing

school_modified <- cleaned %>%
  dplyr::mutate(
    StateCode = as.factor(StateCode),
    StateName = as.factor(StateName),
    Year = as.factor(Year)
  )
glimpse(school_modified)
Rows: 577
Columns: 99
$ Year                                              <fct> 2005, 2005, 2005, 20…
$ StateCode                                         <fct> AL, AK, AZ, AR, CA, …
$ StateName                                         <fct> Alabama, Alaska, Ari…
$ TotalMath                                         <dbl> 559, 519, 530, 552, …
$ TotalTestTakers                                   <dbl> 3985, 3996, 18184, 1…
$ TotalVerbal                                       <dbl> 567, 523, 526, 563, …
$ AcademicSubjectsArtsMusicAverageGpa               <dbl> 3.92, 3.76, 3.85, 3.…
$ AcademicSubjectsArtsMusicAverageYears             <dbl> 2.2, 1.9, 2.1, 2.2, …
$ AcademicSubjectsEnglishAverageGpa                 <dbl> 3.53, 3.35, 3.45, 3.…
$ AcademicSubjectsEnglishAverageYears               <dbl> 3.9, 3.9, 3.9, 4.0, …
$ AcademicSubjectsForeignLanguagesAverageGpa        <dbl> 3.54, 3.34, 3.41, 3.…
$ AcademicSubjectsForeignLanguagesAverageYears      <dbl> 2.6, 2.1, 2.6, 2.6, …
$ AcademicSubjectsMathematicsAverageGpa             <dbl> 3.41, 3.06, 3.25, 3.…
$ AcademicSubjectsMathematicsAverageYears           <dbl> 4.0, 3.5, 3.9, 4.1, …
$ AcademicSubjectsNaturalSciencesAverageGpa         <dbl> 3.52, 3.25, 3.43, 3.…
$ AcademicSubjectsNaturalSciencesAverageYears       <dbl> 3.9, 3.2, 3.4, 3.7, …
$ AcademicSubjectsSocialSciencesHistoryAverageGpa   <dbl> 3.59, 3.39, 3.55, 3.…
$ AcademicSubjectsSocialSciencesHistoryAverageYears <dbl> 3.9, 3.4, 3.3, 3.6, …
$ FamilyIncomeBetween20_40KMath                     <dbl> 513, 492, 498, 513, …
$ FamilyIncomeBetween20_40KTestTakers               <dbl> 324, 401, 2121, 180,…
$ FamilyIncomeBetween20_40KVerbal                   <dbl> 527, 500, 495, 526, …
$ FamilyIncomeBetween40_60KMath                     <dbl> 539, 517, 520, 543, …
$ FamilyIncomeBetween40_60KTestTakers               <dbl> 442, 539, 2270, 245,…
$ FamilyIncomeBetween40_60KVerbal                   <dbl> 551, 522, 518, 555, …
$ FamilyIncomeBetween60_80KMath                     <dbl> 550, 513, 524, 553, …
$ FamilyIncomeBetween60_80KTestTakers               <dbl> 473, 603, 2372, 227,…
$ FamilyIncomeBetween60_80KVerbal                   <dbl> 564, 519, 523, 570, …
$ FamilyIncomeBetween80_100KMath                    <dbl> 566, 528, 534, 570, …
$ FamilyIncomeBetween80_100KTestTakers              <dbl> 475, 444, 1866, 147,…
$ FamilyIncomeBetween80_100KVerbal                  <dbl> 577, 534, 533, 580, …
$ FamilyIncomeLessThan20KMath                       <dbl> 462, 464, 485, 489, …
$ FamilyIncomeLessThan20KTestTakers                 <dbl> 175, 191, 891, 107, …
$ FamilyIncomeLessThan20KVerbal                     <dbl> 474, 467, 474, 486, …
$ FamilyIncomeMoreThan100KMath                      <dbl> 588, 541, 554, 572, …
$ FamilyIncomeMoreThan100KTestTakers                <dbl> 980, 540, 3083, 314,…
$ FamilyIncomeMoreThan100KVerbal                    <dbl> 590, 544, 546, 589, …
$ GpaAMinusMath                                     <dbl> 569, 544, 541, 559, …
$ GpaAMinusTestTakers                               <dbl> 724, 673, 3334, 298,…
$ GpaAMinusVerbal                                   <dbl> 575, 546, 535, 572, …
$ GpaAPlusMath                                      <dbl> 622, 600, 605, 629, …
$ GpaAPlusTestTakers                                <dbl> 563, 173, 1684, 273,…
$ GpaAPlusVerbal                                    <dbl> 623, 604, 593, 639, …
$ GpaAMath                                          <dbl> 600, 580, 571, 579, …
$ GpaATestTakers                                    <dbl> 1032, 671, 3854, 457…
$ GpaAVerbal                                        <dbl> 608, 578, 563, 583, …
$ GpaBMath                                          <dbl> 514, 492, 498, 492, …
$ GpaBTestTakers                                    <dbl> 1253, 1622, 7193, 43…
$ GpaBVerbal                                        <dbl> 525, 499, 499, 511, …
$ GpaCMath                                          <dbl> 436, 466, 458, 419, …
$ GpaCTestTakers                                    <dbl> 188, 418, 1184, 57, …
$ GpaCVerbal                                        <dbl> 451, 472, 464, 436, …
$ GpaDOrLowerMath                                   <dbl> 0, 424, 439, 0, 419,…
$ GpaDOrLowerTestTakers                             <dbl> 0, 12, 16, 0, 240, 1…
$ GpaDOrLowerVerbal                                 <dbl> 0, 466, 435, 0, 408,…
$ GpaNoResponseMath                                 <dbl> 0, 0, 0, 0, 0, 0, 0,…
$ GpaNoResponseTestTakers                           <dbl> 225, 427, 919, 78, 1…
$ GpaNoResponseVerbal                               <dbl> 0, 0, 0, 0, 0, 0, 0,…
$ GenderFemaleMath                                  <dbl> 538, 505, 513, 536, …
$ GenderFemaleTestTakers                            <dbl> 2072, 2161, 9806, 85…
$ GenderFemaleVerbal                                <dbl> 561, 521, 522, 558, …
$ GenderMaleMath                                    <dbl> 582, 535, 549, 570, …
$ GenderMaleTestTakers                              <dbl> 1913, 1835, 8378, 74…
$ GenderMaleVerbal                                  <dbl> 574, 526, 531, 570, …
$ ScoreRangesBetween200To300MathFemales             <dbl> 22, 30, 119, 12, 297…
$ ScoreRangesBetween200To300MathMales               <dbl> 10, 20, 72, 7, 1453,…
$ ScoreRangesBetween200To300MathTotal               <dbl> 32, 50, 191, 19, 443…
$ ScoreRangesBetween200To300VerbalFemales           <dbl> 14, 26, 115, 9, 3382…
$ ScoreRangesBetween200To300VerbalMales             <dbl> 17, 26, 86, 3, 2433,…
$ ScoreRangesBetween200To300VerbalTotal             <dbl> 31, 52, 201, 12, 581…
$ ScoreRangesBetween300To400MathFemales             <dbl> 173, 233, 881, 68, 1…
$ ScoreRangesBetween300To400MathMales               <dbl> 93, 153, 450, 31, 71…
$ ScoreRangesBetween300To400MathTotal               <dbl> 266, 386, 1331, 99, …
$ ScoreRangesBetween300To400VerbalFemales           <dbl> 123, 218, 739, 46, 1…
$ ScoreRangesBetween300To400VerbalMales             <dbl> 84, 171, 613, 42, 10…
$ ScoreRangesBetween300To400VerbalTotal             <dbl> 207, 389, 1352, 88, …
$ ScoreRangesBetween400To500MathFemales             <dbl> 514, 696, 3215, 210,…
$ ScoreRangesBetween400To500MathMales               <dbl> 293, 485, 1948, 137,…
$ ScoreRangesBetween400To500MathTotal               <dbl> 807, 1181, 5163, 347…
$ ScoreRangesBetween400To500VerbalFemales           <dbl> 430, 656, 3048, 183,…
$ ScoreRangesBetween400To500VerbalMales             <dbl> 332, 552, 2398, 141,…
$ ScoreRangesBetween400To500VerbalTotal             <dbl> 762, 1208, 5446, 324…
$ ScoreRangesBetween500To600MathFemales             <dbl> 722, 813, 3576, 316,…
$ ScoreRangesBetween500To600MathMales               <dbl> 614, 616, 3152, 244,…
$ ScoreRangesBetween500To600MathTotal               <dbl> 1336, 1429, 6728, 56…
$ ScoreRangesBetween500To600VerbalFemales           <dbl> 690, 729, 3661, 302,…
$ ScoreRangesBetween500To600VerbalMales             <dbl> 617, 596, 3101, 236,…
$ ScoreRangesBetween500To600VerbalTotal             <dbl> 1307, 1325, 6762, 53…
$ ScoreRangesBetween600To700MathFemales             <dbl> 485, 342, 1688, 204,…
$ ScoreRangesBetween600To700MathMales               <dbl> 611, 445, 2126, 239,…
$ ScoreRangesBetween600To700MathTotal               <dbl> 1096, 787, 3814, 443…
$ ScoreRangesBetween600To700VerbalFemales           <dbl> 596, 423, 1831, 242,…
$ ScoreRangesBetween600To700VerbalMales             <dbl> 613, 375, 1679, 226,…
$ ScoreRangesBetween600To700VerbalTotal             <dbl> 1209, 798, 3510, 468…
$ ScoreRangesBetween700To800MathFemales             <dbl> 156, 47, 327, 49, 54…
$ ScoreRangesBetween700To800MathMales               <dbl> 292, 116, 630, 83, 8…
$ ScoreRangesBetween700To800MathTotal               <dbl> 448, 163, 957, 132, …
$ ScoreRangesBetween700To800VerbalFemales           <dbl> 219, 109, 412, 77, 5…
$ ScoreRangesBetween700To800VerbalMales             <dbl> 250, 115, 501, 93, 4…
$ ScoreRangesBetween700To800VerbalTotal             <dbl> 469, 224, 913, 170, …
inspect(school_modified)

categorical variables:  
       name  class levels   n missing
1      Year factor     11 577       0
2 StateCode factor     53 577       0
3 StateName factor     53 577       0
                                   distribution
1 2007 (9.2%), 2008 (9.2%) ...                 
2 AK (1.9%), AL (1.9%), AR (1.9%) ...          
3 Alabama (1.9%), Alaska (1.9%) ...            

quantitative variables:  
                                                name   class    min      Q1
1                                          TotalMath numeric 383.00  504.00
2                                    TotalTestTakers numeric 134.00 2536.00
3                                        TotalVerbal numeric 401.00  496.00
4                AcademicSubjectsArtsMusicAverageGpa numeric   3.43    3.76
5              AcademicSubjectsArtsMusicAverageYears numeric   1.20    2.10
6                  AcademicSubjectsEnglishAverageGpa numeric   3.03    3.35
7                AcademicSubjectsEnglishAverageYears numeric   3.50    3.90
8         AcademicSubjectsForeignLanguagesAverageGpa numeric   3.03    3.30
9       AcademicSubjectsForeignLanguagesAverageYears numeric   1.80    2.60
10             AcademicSubjectsMathematicsAverageGpa numeric   2.85    3.12
11           AcademicSubjectsMathematicsAverageYears numeric   3.20    3.80
12         AcademicSubjectsNaturalSciencesAverageGpa numeric   2.87    3.25
13       AcademicSubjectsNaturalSciencesAverageYears numeric   2.80    3.50
14   AcademicSubjectsSocialSciencesHistoryAverageGpa numeric   3.05    3.38
15 AcademicSubjectsSocialSciencesHistoryAverageYears numeric   3.00    3.50
16                     FamilyIncomeBetween20_40KMath numeric   0.00  471.00
17               FamilyIncomeBetween20_40KTestTakers numeric   5.00  214.00
18                   FamilyIncomeBetween20_40KVerbal numeric 387.00  466.00
19                     FamilyIncomeBetween40_60KMath numeric 381.00  493.00
20               FamilyIncomeBetween40_60KTestTakers numeric  10.00  236.00
21                   FamilyIncomeBetween40_60KVerbal numeric 414.00  489.00
22                     FamilyIncomeBetween60_80KMath numeric 249.00  506.00
23               FamilyIncomeBetween60_80KTestTakers numeric   8.00  199.00
24                   FamilyIncomeBetween60_80KVerbal numeric 232.00  501.00
25                    FamilyIncomeBetween80_100KMath numeric 398.00  519.00
26              FamilyIncomeBetween80_100KTestTakers numeric   5.00  164.00
27                  FamilyIncomeBetween80_100KVerbal numeric 433.00  514.00
28                       FamilyIncomeLessThan20KMath numeric   0.00  438.00
29                 FamilyIncomeLessThan20KTestTakers numeric   1.00  124.00
30                     FamilyIncomeLessThan20KVerbal numeric   0.00  429.00
31                      FamilyIncomeMoreThan100KMath numeric   0.00  548.00
32                FamilyIncomeMoreThan100KTestTakers numeric   2.00  427.00
33                    FamilyIncomeMoreThan100KVerbal numeric   0.00  538.00
34                                     GpaAMinusMath numeric   0.00  532.00
35                               GpaAMinusTestTakers numeric   0.00  460.00
36                                   GpaAMinusVerbal numeric   0.00  526.00
37                                      GpaAPlusMath numeric   0.00  607.00
38                                GpaAPlusTestTakers numeric   0.00  274.00
39                                    GpaAPlusVerbal numeric   0.00  595.00
40                                          GpaAMath numeric   0.00  565.00
41                                    GpaATestTakers numeric   0.00  680.00
42                                        GpaAVerbal numeric   0.00  556.00
43                                          GpaBMath numeric   0.00  472.00
44                                    GpaBTestTakers numeric   0.00  676.00
45                                        GpaBVerbal numeric   0.00  470.00
46                                          GpaCMath numeric   0.00  413.00
47                                    GpaCTestTakers numeric   0.00   93.00
48                                        GpaCVerbal numeric   0.00  415.00
49                                   GpaDOrLowerMath numeric   0.00    0.00
50                             GpaDOrLowerTestTakers numeric   0.00    2.00
51                                 GpaDOrLowerVerbal numeric   0.00    0.00
52                                 GpaNoResponseMath numeric   0.00    0.00
53                           GpaNoResponseTestTakers numeric   0.00  107.00
54                               GpaNoResponseVerbal numeric   0.00    0.00
55                                  GenderFemaleMath numeric 368.00  488.00
56                            GenderFemaleTestTakers numeric  73.00 1357.00
57                                GenderFemaleVerbal numeric 399.00  493.00
58                                    GenderMaleMath numeric 394.00  521.00
59                              GenderMaleTestTakers numeric  61.00 1177.00
60                                  GenderMaleVerbal numeric 403.00  499.00
61             ScoreRangesBetween200To300MathFemales numeric   0.00   12.00
62               ScoreRangesBetween200To300MathMales numeric   0.00    8.00
63               ScoreRangesBetween200To300MathTotal numeric   0.00   20.00
64           ScoreRangesBetween200To300VerbalFemales numeric   0.00   12.00
65             ScoreRangesBetween200To300VerbalMales numeric   0.00   14.00
66             ScoreRangesBetween200To300VerbalTotal numeric   0.00   26.00
67             ScoreRangesBetween300To400MathFemales numeric   1.00   95.00
68               ScoreRangesBetween300To400MathMales numeric   1.00   57.00
69               ScoreRangesBetween300To400MathTotal numeric   0.00  149.00
70           ScoreRangesBetween300To400VerbalFemales numeric   1.00   52.00
71             ScoreRangesBetween300To400VerbalMales numeric   1.00   74.00
72             ScoreRangesBetween300To400VerbalTotal numeric   2.00  110.00
73             ScoreRangesBetween400To500MathFemales numeric   0.00  333.00
74               ScoreRangesBetween400To500MathMales numeric   0.00  125.00
75               ScoreRangesBetween400To500MathTotal numeric   0.00  493.00
76           ScoreRangesBetween400To500VerbalFemales numeric   1.00  198.00
77             ScoreRangesBetween400To500VerbalMales numeric   2.00  223.00
78             ScoreRangesBetween400To500VerbalTotal numeric   0.00  354.00
79             ScoreRangesBetween500To600MathFemales numeric   4.00  369.00
80               ScoreRangesBetween500To600MathMales numeric   3.00  292.00
81               ScoreRangesBetween500To600MathTotal numeric   6.00  651.00
82           ScoreRangesBetween500To600VerbalFemales numeric   4.00  356.00
83             ScoreRangesBetween500To600VerbalMales numeric   4.00  318.00
84             ScoreRangesBetween500To600VerbalTotal numeric   1.00  663.00
85             ScoreRangesBetween600To700MathFemales numeric  10.00  284.00
86               ScoreRangesBetween600To700MathMales numeric  15.00  328.00
87               ScoreRangesBetween600To700MathTotal numeric  26.00  609.00
88           ScoreRangesBetween600To700VerbalFemales numeric  13.00  319.00
89             ScoreRangesBetween600To700VerbalMales numeric  10.00  302.00
90             ScoreRangesBetween600To700VerbalTotal numeric  23.00  617.00
91             ScoreRangesBetween700To800MathFemales numeric   2.00   83.00
92               ScoreRangesBetween700To800MathMales numeric   1.00  163.00
93               ScoreRangesBetween700To800MathTotal numeric   1.00  251.00
94           ScoreRangesBetween700To800VerbalFemales numeric   2.00  123.00
95             ScoreRangesBetween700To800VerbalMales numeric   2.00  121.00
96             ScoreRangesBetween700To800VerbalTotal numeric   4.00  246.00
    median       Q3       max         mean           sd   n missing
1   527.00   571.00    619.00   535.682842 4.617161e+01 577       0
2  6468.00 35799.00 241553.00 27914.242634 4.560211e+04 577       0
3   522.00   572.00    612.00   531.334489 4.431830e+01 577       0
4     3.85     3.90      3.96     3.822704 9.324943e-02 577       0
5     2.30     2.50      3.10     2.288735 3.191699e-01 577       0
6     3.51     3.67      3.88     3.500953 1.855612e-01 577       0
7     3.90     4.00      4.10     3.929463 9.297488e-02 577       0
8     3.46     3.63      3.79     3.453345 1.891072e-01 577       0
9     2.80     3.10      3.60     2.850953 3.447069e-01 577       0
10    3.30     3.51      3.76     3.310312 2.152249e-01 577       0
11    3.90     4.10      4.40     3.939341 1.682062e-01 577       0
12    3.42     3.60      3.82     3.418180 1.978311e-01 577       0
13    3.60     3.80      4.20     3.631889 2.031575e-01 577       0
14    3.53     3.68      3.88     3.522166 1.775733e-01 577       0
15    3.60     3.70      4.00     3.618718 1.818971e-01 577       0
16  495.00   533.00    643.00   500.239168 4.845665e+01 577       0
17  580.00  3179.00  35446.00  3234.175043 5.935047e+03 577       0
18  496.00   536.00    634.00   501.327556 4.381485e+01 577       0
19  519.00   554.00    629.00   522.844021 4.307938e+01 577       0
20  636.00  3386.00  28124.00  2847.201040 4.638143e+03 577       0
21  517.00   558.00    628.00   523.083189 4.194743e+01 577       0
22  531.00   564.00    630.00   533.611785 4.250284e+01 577       0
23  523.00  2791.00  17937.00  2269.755633 3.535326e+03 577       0
24  527.00   571.00    645.00   533.845754 4.342085e+01 577       0
25  539.00   575.00    646.00   547.055459 3.836915e+01 577       0
26  441.00  2130.00  15358.00  1803.634315 2.855647e+03 577       0
27  536.00   580.00    651.00   544.582322 3.807276e+01 577       0
28  465.00   510.00    589.00   461.734835 7.924915e+01 577       0
29  347.00  2129.00  42551.00  2433.318891 5.254028e+03 577       0
30  464.00   501.00    579.00   458.433276 6.702153e+01 577       0
31  565.00   587.00    637.00   565.830156 4.213576e+01 577       0
32 1000.00  4405.00  46127.00  4141.346620 7.004940e+03 577       0
33  555.00   590.00    637.00   560.492201 4.258815e+01 577       0
34  552.00   569.00    619.00   550.032929 3.899533e+01 577       0
35 1217.00  6372.00  45869.00  4954.838821 8.126753e+03 577       0
36  547.00   566.00    609.00   543.720971 3.653613e+01 577       0
37  624.00   644.00    683.00   621.700173 4.297317e+01 577       0
38  524.00  1792.00  12184.00  1592.202773 2.384094e+03 577       0
39  616.00   637.00    672.00   613.051993 4.143910e+01 577       0
40  588.00   605.00    655.00   584.989601 4.178994e+01 577       0
41 1390.00  6112.00  42656.00  4925.415945 7.645594e+03 577       0
42  580.00   600.00    637.00   576.849220 3.953681e+01 577       0
43  492.00   511.00    564.00   490.497400 3.776550e+01 577       0
44 2282.00 14745.00 104693.00 11728.670711 1.992444e+04 577       0
45  493.00   517.00    562.00   490.984402 3.658501e+01 577       0
46  436.00   457.00    553.00   426.221837 7.059629e+01 577       0
47  445.00  3060.00  22802.00  2619.736568 4.602100e+03 577       0
48  440.00   464.00    548.00   431.551127 7.161896e+01 577       0
49  389.00   428.00    648.00   266.636049 2.092454e+02 577       0
50   12.00    90.00   2061.00    90.762565 2.353662e+02 577       0
51  394.00   432.00    632.00   272.357019 2.111320e+02 577       0
52  446.00   496.00    589.00   304.010399 2.363443e+02 577       0
53  399.00  2206.00  26744.00  1953.188908 3.595058e+03 577       0
54  462.00   510.00    616.00   315.561525 2.456963e+02 577       0
55  510.00   551.00    611.00   518.415945 4.421719e+01 577       0
56 3428.00 18698.00 133217.00 15011.610052 2.466780e+04 577       0
57  519.00   569.00    611.00   528.348354 4.347032e+01 577       0
58  546.00   592.00    640.00   553.911612 4.838629e+01 577       0
59 2979.00 16718.00 108336.00 12911.246101 2.095955e+04 577       0
60  525.00   577.00    635.00   534.883882 4.541287e+01 577       0
61  102.00   518.00   4294.00   441.296360 7.794464e+02 577       0
62   60.00   468.00   3034.00   308.422877 5.094988e+02 577       0
63  162.00   628.00   6772.00   705.206239 1.281787e+03 577       0
64   46.00   357.00   5111.00   387.736568 7.933545e+02 577       0
65   83.00   570.00  20348.00   651.103986 1.830084e+03 577       0
66   74.00   790.00  10603.00   752.656846 1.514439e+03 577       0
67  599.00  1972.00  24977.00  2180.500867 3.961850e+03 577       0
68  144.00  1389.00  13740.00  1278.972270 2.396515e+03 577       0
69  937.00  3288.00  38161.00  3450.223570 6.353179e+03 577       0
70  206.00  1827.00  22544.00  2017.923744 4.037559e+03 577       0
71  368.00  2081.00  26188.00  1956.982669 3.578360e+03 577       0
72  394.00  3530.00  41262.00  3669.703640 7.235214e+03 577       0
73  670.00  5262.00  43758.00  4597.649913 8.104309e+03 577       0
74  442.00  3412.00  29254.00  3142.771231 5.642535e+03 577       0
75 1096.00  8698.00  73012.00  7737.701906 1.374863e+04 577       0
76  595.00  5082.00  45918.00  4538.656846 8.240606e+03 577       0
77  892.00  5351.00 164622.00  5540.672444 1.313533e+04 577       0
78 1063.00  9280.00  80535.00  8190.334489 1.483361e+04 577       0
79 1006.00  5593.00  35778.00  4332.811092 6.939657e+03 577       0
80  798.00  5163.00  31702.00  3790.287695 6.212880e+03 577       0
81 1830.00 10753.00  67480.00  8125.038128 1.313957e+04 577       0
82  995.00  5835.00  37455.00  4350.618718 6.945720e+03 577       0
83  871.00  5341.00  31449.00  3760.083189 6.010780e+03 577       0
84 1888.00 11266.00  68869.00  8112.571924 1.295497e+04 577       0
85  699.00  3242.00  66431.00  2888.339688 5.908861e+03 577       0
86  732.00  4178.00  49941.00  3166.878683 5.530240e+03 577       0
87 1462.00  7308.00 116372.00  6055.634315 1.135802e+04 577       0
88  718.00  3329.00  71360.00  2914.299827 5.949387e+03 577       0
89  672.00  3215.00  56513.00  2677.575390 5.129549e+03 577       0
90 1383.00  6521.00 127873.00  5592.358752 1.106917e+04 577       0
91  223.00   821.00  24126.00   792.625650 1.787202e+03 577       0
92  406.00  1475.00  30815.00  1306.644714 2.557591e+03 577       0
93  645.00  2301.00  54941.00  2099.256499 4.334327e+03 577       0
94  295.00   987.00  21826.00   849.818024 1.665755e+03 577       0
95  295.00   970.00  20460.00   847.265165 1.625229e+03 577       0
96  605.00  1971.00  42286.00  1697.116118 3.289674e+03 577       0
skim(school_modified)
Data summary
Name school_modified
Number of rows 577
Number of columns 99
_______________________
Column type frequency:
factor 3
numeric 96
________________________
Group variables None

Variable type: factor

skim_variable n_missing complete_rate ordered n_unique top_counts
Year 0 1 FALSE 11 200: 53, 200: 53, 201: 53, 201: 53
StateCode 0 1 FALSE 53 AK: 11, AL: 11, AR: 11, AZ: 11
StateName 0 1 FALSE 53 Ala: 11, Ala: 11, Ari: 11, Ark: 11

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
TotalMath 0 1 535.68 46.17 383.00 504.00 527.00 571.00 619.00 ▁▁▇▆▅
TotalTestTakers 0 1 27914.24 45602.11 134.00 2536.00 6468.00 35799.00 241553.00 ▇▁▁▁▁
TotalVerbal 0 1 531.33 44.32 401.00 496.00 522.00 572.00 612.00 ▁▂▇▃▅
AcademicSubjectsArtsMusicAverageGpa 0 1 3.82 0.09 3.43 3.76 3.85 3.90 3.96 ▁▁▂▆▇
AcademicSubjectsArtsMusicAverageYears 0 1 2.29 0.32 1.20 2.10 2.30 2.50 3.10 ▁▂▇▅▂
AcademicSubjectsEnglishAverageGpa 0 1 3.50 0.19 3.03 3.35 3.51 3.67 3.88 ▂▆▇▇▃
AcademicSubjectsEnglishAverageYears 0 1 3.93 0.09 3.50 3.90 3.90 4.00 4.10 ▁▁▂▇▇
AcademicSubjectsForeignLanguagesAverageGpa 0 1 3.45 0.19 3.03 3.30 3.46 3.63 3.79 ▃▇▇▇▇
AcademicSubjectsForeignLanguagesAverageYears 0 1 2.85 0.34 1.80 2.60 2.80 3.10 3.60 ▁▅▆▇▃
AcademicSubjectsMathematicsAverageGpa 0 1 3.31 0.22 2.85 3.12 3.30 3.51 3.76 ▂▇▅▇▃
AcademicSubjectsMathematicsAverageYears 0 1 3.94 0.17 3.20 3.80 3.90 4.10 4.40 ▁▁▇▆▂
AcademicSubjectsNaturalSciencesAverageGpa 0 1 3.42 0.20 2.87 3.25 3.42 3.60 3.82 ▁▆▇▇▅
AcademicSubjectsNaturalSciencesAverageYears 0 1 3.63 0.20 2.80 3.50 3.60 3.80 4.20 ▁▁▇▇▁
AcademicSubjectsSocialSciencesHistoryAverageGpa 0 1 3.52 0.18 3.05 3.38 3.53 3.68 3.88 ▁▆▆▇▅
AcademicSubjectsSocialSciencesHistoryAverageYears 0 1 3.62 0.18 3.00 3.50 3.60 3.70 4.00 ▁▃▇▇▂
FamilyIncomeBetween20_40KMath 0 1 500.24 48.46 0.00 471.00 495.00 533.00 643.00 ▁▁▁▇▅
FamilyIncomeBetween20_40KTestTakers 0 1 3234.18 5935.05 5.00 214.00 580.00 3179.00 35446.00 ▇▁▁▁▁
FamilyIncomeBetween20_40KVerbal 0 1 501.33 43.81 387.00 466.00 496.00 536.00 634.00 ▁▇▇▅▁
FamilyIncomeBetween40_60KMath 0 1 522.84 43.08 381.00 493.00 519.00 554.00 629.00 ▁▂▇▆▂
FamilyIncomeBetween40_60KTestTakers 0 1 2847.20 4638.14 10.00 236.00 636.00 3386.00 28124.00 ▇▁▁▁▁
FamilyIncomeBetween40_60KVerbal 0 1 523.08 41.95 414.00 489.00 517.00 558.00 628.00 ▁▇▇▇▂
FamilyIncomeBetween60_80KMath 0 1 533.61 42.50 249.00 506.00 531.00 564.00 630.00 ▁▁▁▇▅
FamilyIncomeBetween60_80KTestTakers 0 1 2269.76 3535.33 8.00 199.00 523.00 2791.00 17937.00 ▇▁▁▁▁
FamilyIncomeBetween60_80KVerbal 0 1 533.85 43.42 232.00 501.00 527.00 571.00 645.00 ▁▁▁▇▅
FamilyIncomeBetween80_100KMath 0 1 547.06 38.37 398.00 519.00 539.00 575.00 646.00 ▁▁▇▅▂
FamilyIncomeBetween80_100KTestTakers 0 1 1803.63 2855.65 5.00 164.00 441.00 2130.00 15358.00 ▇▁▁▁▁
FamilyIncomeBetween80_100KVerbal 0 1 544.58 38.07 433.00 514.00 536.00 580.00 651.00 ▁▇▇▇▁
FamilyIncomeLessThan20KMath 0 1 461.73 79.25 0.00 438.00 465.00 510.00 589.00 ▁▁▁▇▇
FamilyIncomeLessThan20KTestTakers 0 1 2433.32 5254.03 1.00 124.00 347.00 2129.00 42551.00 ▇▁▁▁▁
FamilyIncomeLessThan20KVerbal 0 1 458.43 67.02 0.00 429.00 464.00 501.00 579.00 ▁▁▁▇▇
FamilyIncomeMoreThan100KMath 0 1 565.83 42.14 0.00 548.00 565.00 587.00 637.00 ▁▁▁▁▇
FamilyIncomeMoreThan100KTestTakers 0 1 4141.35 7004.94 2.00 427.00 1000.00 4405.00 46127.00 ▇▁▁▁▁
FamilyIncomeMoreThan100KVerbal 0 1 560.49 42.59 0.00 538.00 555.00 590.00 637.00 ▁▁▁▁▇
GpaAMinusMath 0 1 550.03 39.00 0.00 532.00 552.00 569.00 619.00 ▁▁▁▁▇
GpaAMinusTestTakers 0 1 4954.84 8126.75 0.00 460.00 1217.00 6372.00 45869.00 ▇▁▁▁▁
GpaAMinusVerbal 0 1 543.72 36.54 0.00 526.00 547.00 566.00 609.00 ▁▁▁▁▇
GpaAPlusMath 0 1 621.70 42.97 0.00 607.00 624.00 644.00 683.00 ▁▁▁▁▇
GpaAPlusTestTakers 0 1 1592.20 2384.09 0.00 274.00 524.00 1792.00 12184.00 ▇▁▁▁▁
GpaAPlusVerbal 0 1 613.05 41.44 0.00 595.00 616.00 637.00 672.00 ▁▁▁▁▇
GpaAMath 0 1 584.99 41.79 0.00 565.00 588.00 605.00 655.00 ▁▁▁▁▇
GpaATestTakers 0 1 4925.42 7645.59 0.00 680.00 1390.00 6112.00 42656.00 ▇▁▁▁▁
GpaAVerbal 0 1 576.85 39.54 0.00 556.00 580.00 600.00 637.00 ▁▁▁▁▇
GpaBMath 0 1 490.50 37.77 0.00 472.00 492.00 511.00 564.00 ▁▁▁▁▇
GpaBTestTakers 0 1 11728.67 19924.44 0.00 676.00 2282.00 14745.00 104693.00 ▇▁▁▁▁
GpaBVerbal 0 1 490.98 36.59 0.00 470.00 493.00 517.00 562.00 ▁▁▁▁▇
GpaCMath 0 1 426.22 70.60 0.00 413.00 436.00 457.00 553.00 ▁▁▁▇▆
GpaCTestTakers 0 1 2619.74 4602.10 0.00 93.00 445.00 3060.00 22802.00 ▇▂▁▁▁
GpaCVerbal 0 1 431.55 71.62 0.00 415.00 440.00 464.00 548.00 ▁▁▁▇▇
GpaDOrLowerMath 0 1 266.64 209.25 0.00 0.00 389.00 428.00 648.00 ▆▁▂▇▁
GpaDOrLowerTestTakers 0 1 90.76 235.37 0.00 2.00 12.00 90.00 2061.00 ▇▁▁▁▁
GpaDOrLowerVerbal 0 1 272.36 211.13 0.00 0.00 394.00 432.00 632.00 ▆▁▁▇▁
GpaNoResponseMath 0 1 304.01 236.34 0.00 0.00 446.00 496.00 589.00 ▇▁▁▆▇
GpaNoResponseTestTakers 0 1 1953.19 3595.06 0.00 107.00 399.00 2206.00 26744.00 ▇▁▁▁▁
GpaNoResponseVerbal 0 1 315.56 245.70 0.00 0.00 462.00 510.00 616.00 ▇▁▁▆▇
GenderFemaleMath 0 1 518.42 44.22 368.00 488.00 510.00 551.00 611.00 ▁▁▇▅▃
GenderFemaleTestTakers 0 1 15011.61 24667.80 73.00 1357.00 3428.00 18698.00 133217.00 ▇▁▁▁▁
GenderFemaleVerbal 0 1 528.35 43.47 399.00 493.00 519.00 569.00 611.00 ▁▂▇▃▅
GenderMaleMath 0 1 553.91 48.39 394.00 521.00 546.00 592.00 640.00 ▁▁▇▆▅
GenderMaleTestTakers 0 1 12911.25 20959.55 61.00 1177.00 2979.00 16718.00 108336.00 ▇▁▁▁▁
GenderMaleVerbal 0 1 534.88 45.41 403.00 499.00 525.00 577.00 635.00 ▁▃▇▆▃
ScoreRangesBetween200To300MathFemales 0 1 441.30 779.45 0.00 12.00 102.00 518.00 4294.00 ▇▁▁▁▁
ScoreRangesBetween200To300MathMales 0 1 308.42 509.50 0.00 8.00 60.00 468.00 3034.00 ▇▁▁▁▁
ScoreRangesBetween200To300MathTotal 0 1 705.21 1281.79 0.00 20.00 162.00 628.00 6772.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalFemales 0 1 387.74 793.35 0.00 12.00 46.00 357.00 5111.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalMales 0 1 651.10 1830.08 0.00 14.00 83.00 570.00 20348.00 ▇▁▁▁▁
ScoreRangesBetween200To300VerbalTotal 0 1 752.66 1514.44 0.00 26.00 74.00 790.00 10603.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathFemales 0 1 2180.50 3961.85 1.00 95.00 599.00 1972.00 24977.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathMales 0 1 1278.97 2396.51 1.00 57.00 144.00 1389.00 13740.00 ▇▁▁▁▁
ScoreRangesBetween300To400MathTotal 0 1 3450.22 6353.18 0.00 149.00 937.00 3288.00 38161.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalFemales 0 1 2017.92 4037.56 1.00 52.00 206.00 1827.00 22544.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalMales 0 1 1956.98 3578.36 1.00 74.00 368.00 2081.00 26188.00 ▇▁▁▁▁
ScoreRangesBetween300To400VerbalTotal 0 1 3669.70 7235.21 2.00 110.00 394.00 3530.00 41262.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathFemales 0 1 4597.65 8104.31 0.00 333.00 670.00 5262.00 43758.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathMales 0 1 3142.77 5642.54 0.00 125.00 442.00 3412.00 29254.00 ▇▁▁▁▁
ScoreRangesBetween400To500MathTotal 0 1 7737.70 13748.63 0.00 493.00 1096.00 8698.00 73012.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalFemales 0 1 4538.66 8240.61 1.00 198.00 595.00 5082.00 45918.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalMales 0 1 5540.67 13135.33 2.00 223.00 892.00 5351.00 164622.00 ▇▁▁▁▁
ScoreRangesBetween400To500VerbalTotal 0 1 8190.33 14833.61 0.00 354.00 1063.00 9280.00 80535.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathFemales 0 1 4332.81 6939.66 4.00 369.00 1006.00 5593.00 35778.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathMales 0 1 3790.29 6212.88 3.00 292.00 798.00 5163.00 31702.00 ▇▁▁▁▁
ScoreRangesBetween500To600MathTotal 0 1 8125.04 13139.57 6.00 651.00 1830.00 10753.00 67480.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalFemales 0 1 4350.62 6945.72 4.00 356.00 995.00 5835.00 37455.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalMales 0 1 3760.08 6010.78 4.00 318.00 871.00 5341.00 31449.00 ▇▁▁▁▁
ScoreRangesBetween500To600VerbalTotal 0 1 8112.57 12954.97 1.00 663.00 1888.00 11266.00 68869.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathFemales 0 1 2888.34 5908.86 10.00 284.00 699.00 3242.00 66431.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathMales 0 1 3166.88 5530.24 15.00 328.00 732.00 4178.00 49941.00 ▇▁▁▁▁
ScoreRangesBetween600To700MathTotal 0 1 6055.63 11358.02 26.00 609.00 1462.00 7308.00 116372.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalFemales 0 1 2914.30 5949.39 13.00 319.00 718.00 3329.00 71360.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalMales 0 1 2677.58 5129.55 10.00 302.00 672.00 3215.00 56513.00 ▇▁▁▁▁
ScoreRangesBetween600To700VerbalTotal 0 1 5592.36 11069.17 23.00 617.00 1383.00 6521.00 127873.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathFemales 0 1 792.63 1787.20 2.00 83.00 223.00 821.00 24126.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathMales 0 1 1306.64 2557.59 1.00 163.00 406.00 1475.00 30815.00 ▇▁▁▁▁
ScoreRangesBetween700To800MathTotal 0 1 2099.26 4334.33 1.00 251.00 645.00 2301.00 54941.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalFemales 0 1 849.82 1665.76 2.00 123.00 295.00 987.00 21826.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalMales 0 1 847.27 1625.23 2.00 121.00 295.00 970.00 20460.00 ▇▁▁▁▁
ScoreRangesBetween700To800VerbalTotal 0 1 1697.12 3289.67 4.00 246.00 605.00 1971.00 42286.00 ▇▁▁▁▁

Target Variables

  • TotalMath: If you want to predict math performance scores, this could be the target variable.

  • TotalVerbal: If you aim to predict verbal performance scores, this could be the target.

Predictor Variables

  • StateName: This indicates the specific state where students are located, allowing analysis of how different regions influence test performance.

  • StateCode: This shows the state where students are from, helping to see how location affects test scores.

  • TotalTestTakers: This counts how many students took the test, giving insight into the testing situation.

  • GenderFemaleMath / GenderMaleMath / GenderFemaleVerbal / GenderMaleVerbal: These show how well boys and girls did in math and verbal sections, highlighting gender differences in performance.

  • Family_income: This indicates the income level of students’ families, helping to see how money affects test scores.

  • Score Ranges: This groups students by their test scores, helping to understand how many students perform at different levels.

  • Academic Subjects Average GPA: This shows students’ overall grades, which can predict how well they do on tests.

Analyzing the qualitative data

StateName

school_modified%>% count(StateName)
# A tibble: 53 × 2
   StateName                n
   <fct>                <int>
 1 Alabama                 11
 2 Alaska                  11
 3 Arizona                 11
 4 Arkansas                11
 5 California              11
 6 Colorado                11
 7 Connecticut             11
 8 Delaware                11
 9 District Of Columbia    11
10 Florida                 11
# ℹ 43 more rows

Observations

It takes a count of test takers from various states. Most states, including New Hampshire, New Jersey, and New York, have 11 observations each. However, Puerto Rico, Virgin Islands and West Virginia have only 9 observations. This uniformity in the number of observations suggests that data collection was relatively consistent across most states, allowing for reliable comparisons.

StateCode

school_modified%>% count(StateCode)
# A tibble: 53 × 2
   StateCode     n
   <fct>     <int>
 1 AK           11
 2 AL           11
 3 AR           11
 4 AZ           11
 5 CA           11
 6 CO           11
 7 CT           11
 8 DC           11
 9 DE           11
10 FL           11
# ℹ 43 more rows

Observations

This analysis does the same as the previous one but focuses on the state codes instead of state names. This allows for a more standardized comparison, as state codes are often used in datasets to represent geographical regions.

Year

school_modified%>% count(Year)
# A tibble: 11 × 2
   Year      n
   <fct> <int>
 1 2005     52
 2 2006     52
 3 2007     53
 4 2008     53
 5 2009     51
 6 2010     51
 7 2011     53
 8 2012     53
 9 2013     53
10 2014     53
11 2015     53

Observations

The dataset covers years from 2005 to 2014, with varying numbers of observations: 53 observations for 2007, 2008, 2011, 2012, 2013, 2014 and 2015 52 observations for the years 2005, 2006 51 observations for 2009 and 2010 These changes in the number of observations each year might indicate differences in data collection methods or participation rates, with the highest number of observations occurring after 2010.

Analyzing the quantitative data

school_modified
# A tibble: 577 × 99
   Year  StateCode StateName            TotalMath TotalTestTakers TotalVerbal
   <fct> <fct>     <fct>                    <dbl>           <dbl>       <dbl>
 1 2005  AL        Alabama                    559            3985         567
 2 2005  AK        Alaska                     519            3996         523
 3 2005  AZ        Arizona                    530           18184         526
 4 2005  AR        Arkansas                   552            1600         563
 5 2005  CA        California                 522          186552         504
 6 2005  CO        Colorado                   560           11990         560
 7 2005  CT        Connecticut                517           34313         517
 8 2005  DE        Delaware                   502            6257         503
 9 2005  DC        District Of Columbia       478            3622         490
10 2005  FL        Florida                    498           93505         498
# ℹ 567 more rows
# ℹ 93 more variables: AcademicSubjectsArtsMusicAverageGpa <dbl>,
#   AcademicSubjectsArtsMusicAverageYears <dbl>,
#   AcademicSubjectsEnglishAverageGpa <dbl>,
#   AcademicSubjectsEnglishAverageYears <dbl>,
#   AcademicSubjectsForeignLanguagesAverageGpa <dbl>,
#   AcademicSubjectsForeignLanguagesAverageYears <dbl>, …

FamilyIncomeLessThan20KMath

gf_histogram(~FamilyIncomeLessThan20KMath, data = school_modified)

Observations

The histogram for students from families earning less than $20,000 shows that most math scores are higher, especially around 400 and above. This indicates that many students are doing well in math, indicating a supportive educational environment. The few students scoring much lower create a small left tail, making the distribution left skewed.

FamilyIncomeBetween20_40KMath

gf_histogram(~FamilyIncomeBetween20_40KMath , data = school_modified)

Observations

It shows that most math scores are grouped around the higher range, especially near 400. This means many students in this income group do well in math. Similar to the previous group, there are only a few students with much lower scores, creating a left skew in the data. While some students scored lower, a significant number are achieving high math scores despite having moderate family incomes.

FamilyIncomeBetween40_60KMath

gf_histogram(~FamilyIncomeBetween40_60KMath , data = school_modified)

Observations

shows that most math scores are concentrated around 500. This indicates that many students in this income group perform well in math. Unlike the previous income groups, there is a more balanced distribution of scores, with fewer students scoring very low and a significant number scoring higher. This results in a slight right skew, meaning the tail on the right side of the distribution is longer than the left. This suggests that while many students excel, there are still some lower scores present.

FamilyIncomeBetween60_80KMath

gf_histogram(~FamilyIncomeBetween60_80KMath , data = school_modified)

Observations

The majority of students math scores fall within the range of 450 to 550, with a peak around 500, indicating that most students in this income group perform at a similar level. There are fewer students scoring in the lower ranges, below 400. The histogram also appears to be left-skewed, as the tail on the left side is longer, showing that a small number of students have much lower scores while the majority have higher scores.

FamilyIncomeBetween80_100KMath

gf_histogram(~FamilyIncomeBetween80_100KMath , data = school_modified)

Observations

The distribution of math scores is centered around 500. The highest count is around this central value, with the number of observations gradually decreasing on both sides. There are a few scores below 450 and above 600, indicating a relatively balanced distribution. However, the shape is slightly right-skewed, as the tail extends more to the right after the central peak, suggesting that a small number of students scored higher than most others.

FamilyIncomeMoreThan100KMath

gf_histogram(~FamilyIncomeMoreThan100KMath , data = school_modified)

Observations

The distribution of math scores is also centered around 600, but this distribution is more concentrated with fewer observations spread across the range. The highest count occurs around the 600 mark, with the number of scores sharply dropping off before 500. This distribution is left-skewed, meaning that most students score on the higher end, while only a small number of students have lower scores, with very few scoring below 400. The data is more concentrated at the higher scores, with a long tail extending towards the lower scores.

Research Questions

  1. How does family income level affect the average test scores of students in different states?

  2. Is there a significant difference in test performance between male and female students across various academic subjects?

  3. What is the relationship between the number of total test takers and the average GPA in different states?

Plot: All Subjects

subjects <- school_modified %>% 
  rename(
    ArtsMusic = AcademicSubjectsArtsMusicAverageGpa,
    English = AcademicSubjectsEnglishAverageGpa ,
    ForeignLanguages = AcademicSubjectsForeignLanguagesAverageGpa ,
    Mathematics = AcademicSubjectsMathematicsAverageGpa,
    NaturalSciences = AcademicSubjectsNaturalSciencesAverageGpa ,
    SocialSciencesHistory = AcademicSubjectsSocialSciencesHistoryAverageGpa
  ) %>%
  select(ArtsMusic,English,ForeignLanguages,Mathematics,NaturalSciences,SocialSciencesHistory)
subjects
# A tibble: 577 × 6
   ArtsMusic English ForeignLanguages Mathematics NaturalSciences
       <dbl>   <dbl>            <dbl>       <dbl>           <dbl>
 1      3.92    3.53             3.54        3.41            3.52
 2      3.76    3.35             3.34        3.06            3.25
 3      3.85    3.45             3.41        3.25            3.43
 4      3.9     3.61             3.64        3.46            3.55
 5      3.76    3.32             3.29        3.05            3.2 
 6      3.88    3.49             3.41        3.33            3.43
 7      3.66    3.13             3.03        3               3.07
 8      3.71    3.21             3.18        3.07            3.19
 9      3.54    3.03             3.04        2.91            2.99
10      3.77    3.29             3.3         3.07            3.27
# ℹ 567 more rows
# ℹ 1 more variable: SocialSciencesHistory <dbl>
GGally::ggpairs(
  subjects %>% drop_na(),
  columns = c("ArtsMusic", "English", "ForeignLanguages", "Mathematics", "NaturalSciences", "SocialSciencesHistory"),
  switch = "both",
  progress = FALSE,
  diag = list(continuous = "densityDiag"),
  lower = list(continuous = wrap("smooth", alpha = 0.3, se = FALSE)),
  title = "Academic Scores Correlation Plot"
)

Plot Analysis

  1. Type of Charts.

    The chart has three different types of visuals. First, the scatter plot shows how students scores in different subjects relate to each other, making it easy to see how well a score in one subject goes along with a score in another. Second, the density plot gives a smooth view of how scores are spread out, showing where most students scores are concentrated. Lastly, the correlation matrix shows numbers that indicate how strongly the scores in different subjects are connected, helping to understand how students perform across subjects.

  2. Variables Used for Various Geometrical Aspects

    X-axis and Y-axis Variables:

    ArtsMusic

    English

    ForeignLanguages

    Mathematics

    NaturalSciences

    SocialSciencesHistory

    Each subject listed in the columns will have its scores plotted along both axes during the pairwise comparisons.

  3. What activity might have been carried out to obtain the data graphed here?

    To gather the data shown in the Academic Scores in Different Subjects chart, students likely took part in a structured assessment in their schools. This could have included tests given to students in different grades or classes. They probably completed standardized tests in subjects like Mathematics, Natural Sciences, Social Sciences, English, Foreign Languages, and Arts Music.

  4. Hypothesis/Research Question

    What is the relationship between the students scores in different subjects and how does what is the overall distribution of these scores?

  5. Two-Line Story Based on the Graph

    It’s surprising to observe a strong positive correlation between Math and Science scores, as well as between English and Science and English and Social Science scores. This suggests that students who excel in one subject often do well in others. A key takeaway is that while many students achieve above-average scores, a significant number still face challenges, indicating a need for support to help those struggling improve their academic performance.

Plot: Maths vs Family Income

name_data <- school_modified %>%
  select(
    FamilyIncomeBetween40_60KMath,
    FamilyIncomeBetween60_80KMath,
    FamilyIncomeBetween80_100KMath,
    FamilyIncomeLessThan20KMath,
    FamilyIncomeBetween20_40KMath,
    FamilyIncomeMoreThan100KMath
  ) %>%
  pivot_longer(
    cols = everything(),
    names_to = "names",
    values_to = "values"
  ) %>%
  mutate(names = case_when(
    names == "FamilyIncomeLessThan20KMath" ~ "LessThan20K",
    names == "FamilyIncomeBetween20_40KMath" ~ "Between20_40K",
    names == "FamilyIncomeBetween40_60KMath" ~ "Between40_60K",
    names == "FamilyIncomeBetween60_80KMath" ~ "Between60_80K",
    names == "FamilyIncomeBetween80_100KMath" ~ "Between80_100K",
    names == "FamilyIncomeMoreThan100KMath" ~ "MoreThan100K"
  )) 
name_data <- name_data %>%
  select(names, values) %>%
  dplyr::mutate(
    names = factor(names,
      levels = c("LessThan20K", "Between20_40K", "Between40_60K","Between60_80K", "Between80_100K", "MoreThan100K"),
      labels = c("LessThan20K", "Between20_40K", "Between40_60K","Between60_80K", "Between80_100K", "MoreThan100K"),
      ordered = TRUE
    )
  )
name_data
# A tibble: 3,462 × 2
   names          values
   <ord>           <dbl>
 1 Between40_60K     539
 2 Between60_80K     550
 3 Between80_100K    566
 4 LessThan20K       462
 5 Between20_40K     513
 6 MoreThan100K      588
 7 Between40_60K     517
 8 Between60_80K     513
 9 Between80_100K    528
10 LessThan20K       464
# ℹ 3,452 more rows
name_data %>%
  gf_boxplot(reorder(names, values, FUN = median) ~ values) %>%
  gf_labs(title = "Maths vs Family Income",
          x = "Scores In Maths",
          y = "Income Class")

Plot with specified colours

name_data %>%
  gf_boxplot(reorder(names, values, FUN = median) ~ values,
             fill = ~names,
             alpha = 0.8) %>%
  gf_labs(title = "Maths Scores vs Family Income",
          x = "Scores In Maths",
          y = "Income Class") %>%
  gf_refine(scale_fill_manual(values = c("gray80", "gray65", "gray50", "gray40", "gray25", "gray10")))

At first, I struggled to replicate the exact colors from the website. However, adjusting the opacity with alpha improved the color matching. This adjustment helped me get as close as possible to the necessary colors.

  1. Type of Charts.

    Box Plot: It compares math scores across different income groups, showing the differences in average scores (median) and the range of scores (spread) for each group. This chart helps us see how math scores change with income levels and whether higher incomes are linked to higher and more consistent scores.

  2. Variables Used for Various Geometrical Aspects

    X-axis Variables: Represents the math scores for students named as values.

    Y-axis Variables: The variable names represents the different family income categories. These categories include LessThan20K, Between20_40K, Between40_60K, Between60_80K, Between80_100K, and MoreThan100K. These were created by mutating the original columns from FamilyIncomeLessThan20KMath, FamilyIncomeBetween20_40KMath, FamilyIncomeBetween40_60KMath, FamilyIncomeBetween60_80KMath, FamilyIncomeBetween80_100KMath and FamilyIncomeMoreThan100KMath.

    Fill: The names variables is used for color-coded representation of each income group, such as LessThan20K, Between20_40K, Between40_60K, Between60_80K, Between80_100K, and MoreThan100K.

  3. What activity might have been carried out to obtain the data graphed here?

    To obtain the data graphed here, a survey or study likely collected information on students’ family incomes after their tests on various subjects, focusing solely on math performance to analyze the relationship between income levels and academic achievement.

  4. Hypothesis/Research Question

    Is there a relationship between family income and students’ performance in mathematics?

  5. Two-Line Story Based on the Graph

    The box plot reveals that lower-income students have a wider range in their performance levels for maths scores. Surprisingly, students from higher income families consistently achieve better median scores, suggesting that financial resources play a significant role in academic success despite the larger population in lower income brackets.

Inferences and My Journey

Throughout my analysis of the school scores dataset, I gained valuable insights into how different factors, such as family income and gender, can influence students’ academic performance. By exploring the data, I discovered that students from various income levels show different patterns in their scores across subjects like Mathematics, English, and Science. For example, students from families earning less than $20,000 often displayed a wider range of performance levels, with some achieving high scores while others struggled. In contrast, those from families earning more than $100,000 consistently scored higher, suggesting that financial stability might provide access to better educational resources. This understanding highlights the importance of considering socioeconomic backgrounds when evaluating educational outcomes and devising strategies for improving academic performance.

I practiced using functions like mutate, pivot_longer, and clean_names, which helped me reshape and standardize the data for better analysis. The mutate function allowed me to create new variables that made it easier to interpret the data, while pivot_longer helped transform the dataset into a longer format, making it suitable for visualization. Creating visualizations, such as scatter plots and box plots, allowed me to see how scores were distributed among students from different family income levels. The histogram for the “FamilyIncomeLessThan20KMath” variable, for example, revealed that most students scored above 400, indicating that, despite their low income, many were performing well. These graphs not only made the data more engaging but also revealed patterns I might have missed when looking at just the table.

The correlation plots were particularly interesting. Using GGally::ggpairs, I was able to visualize the relationships between different subjects, revealing that students who excel in Mathematics also tended to perform well in Science and English. This positive correlation shows that if students do well in one subject, they’re likely to do well in others too. This highlights the importance of teaching methods that connect different subjects and support overall student growth.

I noticed that my first research questions I had are slightly connected to the box plot created. The question looks at how family income affects average test scores in different states, which has some similarities to the boxplot showing the connection between math scores and income. This shows how socio-economic status can influence academic success. The second question explores whether there are performance differences between male and female students across various subjects, which again has its similarities and differences with Chart 1 about academic scores.

Additionally, the box plot comparing math scores with family income provided a clear visual representation of how income levels affect academic performance. The plot showed that students from higher-income families had higher median scores, while those from lower-income backgrounds displayed a wider range of scores, indicating significant differences in academic achievement. This shows how financial resources can impact educational success, suggesting that more support and resources may be needed for lower-income students to ensure them with possibly similar achievements.

Finally, I couldn’t reproduce the exact version of the boxplot graph shown on the website. In my graph, the “LessThan20K” category appears at the bottom, but on the website, it’s the second category. Additionally, the tables listing the names and values don’t match those on the website either.