Profile avatar
cghlewis.bsky.social
Freelance Research Data Management Consultant | cghlewis.com Co-organizer R-Ladies St. Louis Co-organizer POWER Data Management Hub | https://osf.io/ap3tk/ Author Data Management in Large-Scale Education Research | https://datamgmtinedresearch.com/
818 posts 4,873 followers 1,368 following
Prolific Poster
Conversation Starter

When planning for data collection, especially in longitudinal studies, first consider how that data will be used. Ask yourself: - How will we combine data for analysis? - What unique IDs will allow us to do this? - How will we name/code items to combine data? - Will our data need restructuring?

Seems like a good day to share this for anyone who is interested in starting a freelance career working with data. docs.google.com/presentation...

We had a great time with @rladiesamsterdam.bsky.social and R-Ladies RTP discussing "Tidy Data, Weighted Insights: Analyzing Complex Survey Data with #RStats"! Recording is here: youtu.be/ip78qoxoZGc?... Interested in more? We're presenting a workshop at useR! Register here: user2025.r-project.org

If you work with Qualtrics, and want to import a file into R without the extra rows of metadata, here is one way to do that. #rstats

Remote data management job at Vanderbilt ecsr.fa.us2.oraclecloud.com/hcmUI/Candid...

I post a lot about how important it is to not just do data quality checks on your entire sample, but also by grouping variables (to uncover hidden errors). But @emilyriederer.bsky.social wrote a whole article about it! emilyriederer.netlify.app/post/groupin...

"If open science is done transparently, it doesn’t have to be done perfectly. You can always refine as you go." New @nature.com, Serbian scientist @alelazic.bsky.social talks about the lessons she learned developing an open-science resource outside the 'global north' 🧪 www.nature.com/articles/d41...

I'm glad exporting data to different tabs of a spreadsheet is so easy in #rstats.

Remote data management job: jobs.hazeldenbettyford.org/careers-home...

I'm Sarah. I edit, proofread, format ePubs, and do author coaching in addition to research data management consulting. See linktr.ee/sarahoelker for info on my gigs. Come on in, take a look around, and if you know someone who could use my help, please pass this on!

I love the idea of adding entity relation diagrams to READMEs. 👏

I'm gonna talk tomorrow about good dataset READMEs. Any good examples? Self-nomination welcome. Could be academic, business or fun projects. Like: (simple) gabors-data-analysis.com/datasets/hot... (paper) zenodo.org/records/4016... (@koren.mk )

New version of R is out! Our Data Scientist, Russ Hyde, has put together a quick review of the key features and changes in R 4.5 — from new language features to graphics updates and more. 📝 Read the full blog post here: www.jumpingrivers.com/blog/whats-n... #rstats #Rprogramming #opensource

For anyone who didn't know about readr::parse_number(), it's pretty cool. #rstats

I receive some lab test results in a simple normal table. All results are numerical but sometimes the lab reports values in the following format: < 0.10 < 0.05 Any good strategy on dealing with this when working with these data in #rstats? Maybe @cghlewis.bsky.social ?

I had to.

Some really great materials for learning how to manage longitudinal data from @ukdataservice.bsky.social 👇 ukdataservice.ac.uk/2025/04/08/p...

Did you know you could perform web crawling in R using the Rcrawler package?! Web crawling is about indexing and discovering URLs from a page, similar to what a search engine does. My tutorial below walks you through how to use the package: youtu.be/JIRh025E6Dk?...

About once a week someone reaches out to ask if I would be willing to meet (other freelancers wanting to connect, or someone looking to transition into freelancing) and I always say yes, b/c I feel really lucky to have been able to do this work for the last 3 years, and I'm happy to pay it forward.

Someday, maybe I'll find the time to write this up in article format. #databs cghlewis.com/talk/sssp_ecf/

I've posted this before, and will do so again until people pay attention. If you're in the US, or affiliated with the US, and you are an academic doing soon-to-be-censored work. YOU ABSOLUTELY NEED TO BE ENCRYPTING YOUR COMMUNICATION. Signal/Whatsapp is not enough. youtu.be/M2-rly3ernY

*EXECUTIVE ORDER: TALK TO A STATISTICIAN BEFORE YOU BEGIN COLLECTING DATA

What’s every one’s go to folder structure for their projects? #rstats

Psych-DS is (1) spellcheck for your datasets and (2) a pathway to standardizing data in our academic fields that *everyone* can learn. And it's live RIGHT NOW! psych-ds.github.io (This is the announcement post I've been leading up to)

Final_Version_of_Tarrifs_actualFINALcopy_version7_USETHISONE.docx

If you build a data collection instrument in a tool like Qualtrics or REDCap or even a paper form, consider working through a series of pilot tests to ensure you collect quality, usable data!

🌱 today i'd like to talk a bit about code reproducibility an analysis is reproducible if its authors (& others) can run the code for it & get same results consistently reproducibility can mean: ✅ managing environments in which the code is run ✅ sharing input data ✅ writing clear documentation

#rstats do have something that will make a str() or dplyr::glimpse() printout as a markdown table?

Four Things Successful Teams Do Differently We describe 4 keys to team success & why many teams fail to succeed: 1) Focus on a superordinate goal 2) Use collective language 3) Allocate resources wisely 4) Engage in constructive criticism www.powerofusnewsletter.com/p/four-thing...