To Hadoop or not

Hadoop is the hammer of big data. This slide deck covers the history and basics of Hadoop as well as some alternatives.

Advertisements

How Robots are Changing the Way we Sell

I had a talk about how AI is changing marketing and sales at Inbound 2016. The slides are available at http://content.inbound.com/content/ai-how-robots-are-changing-the-way-we-sell. Since the slides are not self-explanatory, I decided to write this companion post. “No humans should perform slave work. It is not interesting, it is tiring and the payment is low. All work than can […]

dict issues in pyspark

I was running some Spark jobs that showed odd results. The output had complex fields that showed up with null values for fields that should always have a value: { “year”: null, “name”: “John Smith”, “age”: null } This puzzled me. I tried hardcoding all those values and setting them once by setting the field to this […]

Self-Preserving Artificial Intelligence

I’m sure most people have heard of the dilemma of whether to design self-driving cars to reduce the number of deaths or to protect their driver. To those of you who haven’t, picture this; you are sitting in your cars which is driving along in a partially blind curve. The car discovers a crowd of people in […]

Named Entities

Here’s an introduction to named entities, named entity recognition (NER), and named entity disambiguation (entity linking). There is also information about how this is useful for Companybook. I originally held this presentation for a Data Science Meetup in Oslo. It’s aimed at data scientists.

Pig lovers meet TOP

Have you ever needed to get the top n items for a key in Pig? For instance the most popular three items in each country for an online store? You could always solve this the hard way by calculating a threshold per country and then filter on that threshold. This is neither to write or execute. What you […]

Crash course in Erlang

This is a summary of a talk I held Monday May 14 2012 at an XP Meetup in Trondheim. It is meant as a teaser for listeners to play with Erlang themselves. First, some basic concepts. Erlang has a form of constant called atom that is defined on first use. They are typically used as […]