About Me

Author

Derek Sollberger

Published

July 9, 2025

About Me

profile

Teaching

Princeton CSML

Current courses

  • Data Science
  • Machine Learning
  • Artificial Intelligence

UC Merced

Previous Courses

  • Bioinformatics
  • Calculus
  • Probability
  • Sports Analytics
  • Statistics

Current Projects

Longitudinal Sentiment Analysis

Hamilton, et al

Abstract Topic Modeling

BERTopic

organize topics from abstracts from

  • Journal of Data Science Education
  • Journal of Biology Education

Image credit: Ashwin Rachha

Home Economics

receipt bookkeeping
  • scan grocery receipts
  • OCR
  • AI text decoding
  • code expense report

Foreign Language Tutor

proficiency levels

LLM prompting. “I am at the A2 level in German. Help me advance to B1.”

  • organize vocabulary words
  • create lesson plans
  • make quizzes

Image credit: Smarter German

Coding School

AI: Medical Imaging

MRI scans
  • data set validation
  • clustering
  • image segmentation
  • annotation

DS: Sentiment Analysis

The Office
  • sentiment: “negative” to “positive”
  • algorithm comparison
  • data standardization
  • feature selection
sessionInfo()
R version 4.5.0 (2025-04-11 ucrt)
Platform: x86_64-w64-mingw32/x64
Running under: Windows 10 x64 (build 19045)

Matrix products: default
  LAPACK version 3.12.1

locale:
[1] LC_COLLATE=English_United States.utf8 
[2] LC_CTYPE=English_United States.utf8   
[3] LC_MONETARY=English_United States.utf8
[4] LC_NUMERIC=C                          
[5] LC_TIME=English_United States.utf8    

time zone: America/New_York
tzcode source: internal

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

loaded via a namespace (and not attached):
 [1] htmlwidgets_1.6.4 compiler_4.5.0    fastmap_1.2.0     cli_3.6.5        
 [5] tools_4.5.0       htmltools_0.5.8.1 rstudioapi_0.17.1 yaml_2.3.10      
 [9] rmarkdown_2.29    knitr_1.50        jsonlite_2.0.0    xfun_0.52        
[13] digest_0.6.37     rlang_1.1.6       evaluate_1.0.3