Tumblelog by Soup.io
Newer posts are loading.
You are at the newest post.
Click here to check if anything new just came in.

November 04 2014

The basic set of data we're collecting in-house

As mentioned in another post, we're now making use of Piwik for in-depth user and blog related data. Currently, the implementation mainly just collects interesting data, and we're still working on a good way to segment it (i.e. write a Piwik plugin).

Anyhow, here it goes:

Visitor related

  • Login status of the user - all following flags apply to a logged in visitor (= Soup user) only
  • Blog privacy - the privacy status the visitor configured for their blog
  • NSFW toggle - this pertains to an upcoming release that let's the visitor toggle if they want to see NSFW material in /everyone, /friends etc.
  • Exports - tells us which exports the visitor has configured (currently this can only be facebook)
  • Reported someone - did the visitor report posts for anything, like spam. This may pertain to the visitors engagement level.
  • Email - did the visitor supply an email with their registration?
  • Which imports did the visitor configure?
  • Did the visitor connect their account to facebook, either via export or signup?
  • How long has the visitor had his account with Soup, in days
  • Which pool does the visitor belong to? Currently there is only A, which are all members of @testkitchen, and B, which is the default for everyone. We may use this to do split-testing in the future.
  • Is the visitor using an adblocker?
  • How many feeds is the visitor importing to their blog?
  • How many original (non-imported) posts does the visitor have on their blog?
  • Days since the last original post of the visitor
  • Number of groups the visitor is member of
So why are we collecting this data? Because we don't know. We have no idea what's relevant. Are users with an email address more likely to come back? Or facebook users? Do we convert people coming from facebook? Does a NSFW feature make people use Soup more at work? (chichichichi...) And in general, what weird patterns that should be investigated can be seen?

FIXME: page attributes description

You can also have an up-to-date look into our internal analytics spreadsheet slash scratchpad where we keep track of what we're keeping track of. This is what we ourselves are referring to when building new queries or reports, so it's pretty definitive.

You can use it to decipher the javascript section in every page that looks like this:

_paq.push(['setCustomVariable', 2, 'rd', '2674', 'visit']); _paq.push(['setCustomVariable', 3, 'fr', '170', 'visit']); _paq.push(['setCustomVariable', 4, 'fe', '6', 'visit']); _paq.push(['setCustomVariable', 5, 'p', '6767', 'visit']); _paq.push(['setCustomVariable', 6, 'o', '0', 'visit']); _paq.push(['setCustomVariable', 7, 'fo', '215', 'visit']); _paq.push(['setCustomVariable', 8, 'g', '33', 'visit']); _paq.push(['setCustomVariable', 1, "u", "-l-B-i-w-exfb-x-fb-e-ob-om-p-898-8af-8b1-119-bf1-249-744-fl-" + window.SOUP_test_ab, "visit"]); _paq.push(['setCustomVariable', 1, "v", "-o-ga-mp-", "page"]); _paq.push(['setCustomVariable', 2, 'rd', '4', 'page']); _paq.push(['setCustomVariable', 3, 'fr', '0', 'page']); _paq.push(['setCustomVariable', 4, 'fe', '0', 'page']); _paq.push(['setCustomVariable', 5, 'p', '0', 'page']); _paq.push(['setCustomVariable', 6, 'o', '-1', 'page']); _paq.push(['setCustomVariable', 7, 'm', '1', 'page']); _paq.push(['setCustomVariable', 8, 'fo', '0', 'page']); _paq.push(['setCustomVariable', 9, 'g', '0', 'page']);

That is what a visit to whatweknow.soup.io generates for me, which translates to:

"Visitor has been registered on Soup for 2674 days, has (follows) 170 friends, has 215 followers, imports six feeds, created 6767 original posts (including deleted ones), 0 days have gone by without creating original content, and he is member of 33 groups. Additionally, we know that he is logged in, in pool B, has configured imports, some of which work, exports posts to facebook, exports posts, has a facebook user, supplied an email address, has created bookmarklet posts, has created mobile posts, has created original posts, has a few (specific, I'm not gonna look them up now) imports configured, and reported other users for something. Also, let us know if the browser uses an ad blocker."

"The page he is visiting is his own, it's a group and he has group admin privileges, and the group moderation policy is public (crap, I need to change that asap). The group has existed for four days, has zero friends, zero feeds, zero posts (I have written a few since I opened the page, also those statistics only get regenerated every 24 hours), -1 days since the last original post (=never made an original post), the group has one member, no followers and the owner is not member of any other group."

Reposted bydanielbohrer danielbohrer

Don't be the product, buy the product!