Managing HUGE Datasets with Scala SparkSome tips and tricks for working with large datasets in scala sparkNov 3, 2021Nov 3, 2021
Published inTowards Data ScienceHow to test your Spark Scala codeLet’s write some tests for Spark Scala DataFrame transformations using Mockito and scalatestMay 31, 20211May 31, 20211
Published inTowards Data ScienceHow to test your ML models from hypothesis to productionMy personal best practices for model testing and evaluation from the first hypothesis to the A/B test in production.Mar 23, 2020Mar 23, 2020
Published inTowards Data ScienceMarkov Chain for music generationFrom this article, you will learn about the Markov Chain model and how it can be applied for the music generation.Jul 8, 2019Jul 8, 2019
Published inTowards Data ScienceGale–Shapley algorithm simply explainedFrom this article, you will learn about stable pairing or stable marriage problem. You will learn how to solve that problem using Game…May 26, 2019May 26, 2019
Atom as Markdown EditorRecently I was searching for the best way to work with Markdown in OS X. I know, that you can do it in Jupyter, but sometimes I need to…Feb 10, 2019Feb 10, 2019
Ubuntu subsystem inside Windows 10I’m a Linux person, but for quite while I was working on Window 7 system because it was approved in a company where I was working. Because…Jan 5, 2019Jan 5, 2019
Monty Hall’s paradox — solve it by simulation!Hi! Have you heard about Monty Hall’s paradox? I mean come on, it’s in every book about Bayesian statistics! Thing is even though I know a…Dec 18, 201810Dec 18, 201810
Published inCindicatorMust-have statistical tests for any Data Scientist — Weekend of a Data ScientistWeekend of a Data Scientist is a series of articles with some cool stuff I care about. Idea is to spend a weekend by learning something…Sep 16, 2018Sep 16, 2018