Blog posts

All posts should be bite sized chunks of information that will take less that about a minute to read. Posts are mostly focussed on R, Git, GitHub, and provide tips for better reproducibility of work, collaboration with other, and transparency of ideas to foster open science principles.

2024

Live testing regular expressions …

2 minute read

Published:

As some point in your career you’ll find yourself needing to parse large amounts of alpha numeric data and selecting sections which match certain string patterns. For example, maybe you are parsing the contents of a webpage looking for hyperlinks, or summarizing the contents of a log file, or searching for values of parameters in a file heavy with text.

Use DockerHub to store and retrieve your containers

1 minute read

Published:

So you have just built your first docker image (or podman image, but more about that in a minute). And it runs! And you are excited! So you start to tell people how cool you are, that you can run “stuff” inside a container. But then are quickly brought back down to earth when you realize that you don’t know how move this image to other platforms, like the cloud. It resides “somewhere” on your local machine, in a location that isn’t obvious.

Run your model in a container

5 minute read

Published:

Containers aren’t new, but to the scientific community (as of writing), they kinda are! So what are containers? Well, simply put, they are a running instance of an image! Oh Right … totally get it!

Stop using the library() function

1 minute read

Published:

Yes, i know, you love using the library() function in R! But it can be a real problem for several reasons:

Use markdown on GitHub

less than 1 minute read

Published:

Markdown for GitHub is the language of choice when creating issues, pull requests, using discussions, or editing your README file on GitHub.

Include DESCRIPTION file in all R projects

1 minute read

Published:

The DESCRIPTION file is a necessary staple for all R packages. Among other things (which we won’t get into, since this isn’t a post about creating R packages) it is the location for you to list all dependencies of your package. So when your package is installed the user also gets all the dependent packages installed too! Nice and tidy!

Use reproducible examples (in R) when asking for help

3 minute read

Published:

Regardless of the language you are coding in, at some point, there will come a time when you just need to ask for help. When you do, you should always include a reproducible example to help explain your problem. Failing to do so will just irritate readers and limit the number of helpful responses.

Delete your last commit from GitHub

less than 1 minute read

Published:

We’ve all done it, you’ll be happily commiting and pushing to GitHub from either the command line or, more than likely, your favorite IDE when you panic and realize that you just mistakenly pushed something to GitHub. You start sweating, and frenetically start googling how to reverse this. Well it turns out it is pretty simple.