@emjonaitis@mathstodon.xyz

In reply to

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

@emjonaitis@mathstodon.xyz · Apr 03, 2026

@RobotDiver heee, it me too!

View full thread on mathstodon.xyz

1

0

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

@emjonaitis@mathstodon.xyz · Apr 02, 2026

I'm analyzing Medicare data -- my first real experience with a large dataset, where the number of observations of interest to me is in the millions. We have repeated measures/clusters to worry about, each ranging from 2 to 10 observations, give or take.

I'm struggling with performance issues in pretty much every approach I take to this dataset. One outcome of interest is a proportion. zoib is painfully slow, even when I take a (stratified) random sample of 2% of rows -- in an hour it's only 4% done fitting my null model. Boundary values (0,1) are common in the data, ruling out "transform and just do lmer."

What general tools are available for modeling bigger datasets in R? Because of data privacy agreements I'm required to do all of the computing on-prem, so unfortunately I don't know that I can take advantage of high throughput computing on other servers, if it were even workable in this case.

#rstats #lme4 #zoib

View on mathstodon.xyz

mathstodon.xyz

Mathstodon

9

0

14

0

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

@emjonaitis@mathstodon.xyz · Mar 28, 2026

Another github migration question, this time for the scholcomm people and librarians:

I've got a repository that is associated with a publication. I'd like to do my due diligence to keep this out of large AI models as much as I can while also keeping it available to any researchers who care to poke more deeply into what we did. (Yes, I recognize that these are in some sense incompatible goods.) Has anyone developed a standard workflow for removing a repository from github while leaving a forwarding address?

View on mathstodon.xyz

2

0

8

0

In reply to

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

Erin Jonaitis

@emjonaitis@mathstodon.xyz

Applied statistician at UW-Madison, interested in longitudinal data analysis, reproducible workflows, and Alzheimer's disease. Views mine; all my errors are independent.

mathstodon.xyz

@emjonaitis@mathstodon.xyz · Jan 27, 2023

@thom @atomicpoet It was kind of a head trip returning to Twitter after a few years away, and trying to wrap my brain around what it meant that my likes might now show up in the timelines of those who followed me. That felt like a dark pattern to me, tbqh.

View full thread on mathstodon.xyz

1

2

0

Erin Jonaitis

Posts

Mathstodon