Full scrape
The repository contains the full professor scrape with a 20-review cap per professor. That layer is used for coverage and concentration diagnostics.
Interactive report
This site combines a full Ivy League professor scrape with a professor-balanced transformer sentiment sample. The main conclusion is not just which school scores highest. It is that review concentration materially changes how reliable those rankings are.
School view
This ranking uses one recent review per reviewed professor. It is the cleaner comparison when the goal is to reduce dominance from a small number of highly reviewed professors.
Hover for exact values and coverage details.
Marker size scales with balanced review count.
Sorted by balanced sentiment.
| School | Sentiment | Positive | Balanced Reviews | Professors | Detail |
|---|
Bias diagnostics
The full scrape reveals how much of each school’s public profile comes from a narrow set of professors. That is the reporting-bias layer the balanced NLP sample is trying to correct for.
Higher means a small professor subset dominates the review pool.
Lower zero-review share and lower concentration is the healthier quadrant.
These figures come from the full professor scrape, not the balanced NLP subset.
| School | Full Reviews | Median Reviews / Professor | Zero-Review Professors | Top 10% Share |
|---|
Department view
Department results are sensitive to coverage. Use the school filter and minimum review threshold to focus on segments with enough data to be interpretable.
Filtered interactively by school and minimum review count.
Negative side of the same filtered population.
| School | Department | Sentiment | Reviews |
|---|
| School | Department | Sentiment | Reviews |
|---|
Downloads
These links point to the generated CSV outputs copied into the deployable site bundle.
Method
The repository contains the full professor scrape with a 20-review cap per professor. That layer is used for coverage and concentration diagnostics.
The sentiment ranking uses one review per reviewed professor. That is a methodological correction, not a convenience sample.
Sentiment scores were generated with distilbert-base-uncased-finetuned-sst-2-english and mapped onto a signed scale from -1 to +1.
This does not measure educational quality directly. It measures review text sentiment and the shape of who gets reviewed.
Static figures