
Background
Why is an AI lab mucking with PROs? Machine learning needs good labels in order to work its wonders. Focusing on label quality is critical. In our lab we’ve both looked at existing outcome measures as well as investigate new ones.
PROs vs other measures
Comparing PROs against other measures isn’t new. Most of those studies show varying correlations. By leveraging the OsteoArthritis (OAI) data set, we could do this analysis at a whole new scale.
So, how do PROs reflect patient performance on standardized functional tests? After all, the goal of total knee arthroplasty is to return functionality. Turns out the correlations are very weak.

What about reflecting structural degradation of the knee? We looked at that too (waiting for publishing). Now, clinicians already know this by experience, but you can have a terrible knee but say much on your PROs.
Other Measures
This all implies that measuring successful outcomes probably needs more than PROs. We are currently researching other possibilities.
Pain
What if there was an objective way to measure how often a patient has pain? This would give researchers something more independent of the psycho-social model.
Gait
What if we could measure patient gait improvement without a full gait lab?