Lots of videos and blogs around what to study, how to prepare etc, but I wanted to share a few key points that I found useful to remember for the exam. My own exam prep details I included at the bottom in case anyone’s interested in reading my experience (I spent a total of 4... Continue Reading →
Some Useful PySpark Commands to Know When Working with Databricks
I had to figure out a bunch of things for a recent Azure Databricks hobby project I was working on. Required a whole bunch of Google searches, and quite some time, so thought I would capture some of the useful things I learned during the process:This was the overall architecture:Creating a ‘standard’ CSV file:When working... Continue Reading →
Working with multiple python installations on Windows
I have Python 2.7 and 3.5 both installed on my Windows machine and several times I wanted install some new package I’ve ended up with the “No module names ‘xxx’” message although I had just installed the package! Every time I end up resolving the issue somehow, but I keep forgetting what I did... Continue Reading →
Working with date fields in R and Power BI
I’m starting to become a big fan of Power BI and the monthly releases are certainly helping the tool get closer to prime time (I think Power BI has come a long way, but still has some catching up to do). I love the Power BI-R integration which significantly enhances the charting capabilities. I’m yet... Continue Reading →
Creating PDF Documents with R/RStudio
For something I’m currently working on I wanted to create some PDF reports so that I can share the results with others via email. Turned out creating PDFs to output the values from R dataframes is not so straightforward after all. If it’s a ggplot type graphic, easiest is probably to do something like: which... Continue Reading →
Launching Spyder on Windows
If you install Spyder using any of the standalone installers after installing Python, you’ll have trouble launching the Spyder IDE. If you go to python_dir/Scripts, you’ll see the following: but clicking on spyder.dat will launch a command prompt for a split second and then disappear. This seems to be a common issues as asked here... Continue Reading →
Using Visual Studio 2013 for Python (3.4) with NumPy and SciPy on Windows
There seem to be various editors for Python and there are many articles online (e.g., this blog post) that discuss the features of the various editors. PyCharm by JetBrains seems pretty popular, but while I was Googling for Python editors, I came across Python Tools for Visual Studio. Coming from a C# background, I thought... Continue Reading →
Charting with WPF/C#
Visualization techniques have been evolving rapidly and with a growing trend towards ‘big data’ and ‘analytics’ there are so many technologies to choose from. If the development technology is not a constraint, D3 is arguably the best way to go, but if you are looking for some charting capabilities within a WPF/C# development environment your... Continue Reading →
PPO vs HDP healthcare plans
It’s that time of the year again where most employees need to sign up for their healthcare plans for 2014 (assuming you are getting insurance through your employer). There seem to be all sorts of healthcare plans like PPO, ABHP/HSA/HDP, HMO, EPO, POS etc (here are a couple of references that discuss what all the... Continue Reading →