Why are my Analytics stats inflated? - Vanilla Success
<main> <article class="userContent"> <p>If you are using Advanced Analytics Vanilla collects traffic statistics for your site using a service called Keen.io. The rules for Vanilla Analytics are very simple: </p><ul><li>Any page that is requested that responds with a http code 200.</li><li>The initiator of the page load can be any user or guest. </li><li>The initiator can even be a bot.</li></ul><p>You might be comparing you Vanilla Analytics statistics with, for example, Google Analytics and finding some discrepancies between the two. Google Analytics can be configured in multiple ways to return different data. We have no control over that. Most likely Google filters out crawlers, most notably their own.</p><p>As stated, Vanilla's Analytics tool does not filter robots. It can be easily under-estimated how much traffic comes from crawlers like GoogleBot. Here is a typical picture of a 15 day period of a site where we query of all the traffic in our access logs excluding web crawlers:</p><div class="embedExternal embedImage"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/8GQSGHVCVA93/screen-shot-2020-06-01-at-4-37-06-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/8GQSGHVCVA93/screen-shot-2020-06-01-at-4-37-06-pm.png" alt="Screen Shot 2020-06-01 at 4.37.06 PM.png" /> </a> </div> </div> <p>And here is the same period but showing <strong>ONLY</strong> crawlers:</p><div class="embedExternal embedImage"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/XAKUXGFD7J1G/screen-shot-2020-06-01-at-4-38-34-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/XAKUXGFD7J1G/screen-shot-2020-06-01-at-4-38-34-pm.png" alt="Screen Shot 2020-06-01 at 4.38.34 PM.png" /> </a> </div> </div> <p>Together that makes almost one million hits, but almost a <strong>quarter million</strong> are coming from bots. The majority of those bots are coming from Google and so would be excluded from Google Analytics.</p><p>In this example you can see that crawler traffic is fairly consistant. It is not always like that. There are often periods of intense crawler activity followed by relatively little crawler traffic.</p><h2 data-id="google-analytics">Google Analytics</h2><p>There are other factors that can skew Google Analytics statistics. From the Wikipedia article on Google Analytics: </p><div class="blockquote"><div class="blockquote-content"><p class="blockquote-line">[There are] many <a href="https://en.wikipedia.org/wiki/Ad_filtering" rel="nofollow noreferrer ugc">ad filtering</a> programs and extensions such as <a href="https://en.wikipedia.org/wiki/Firefox" rel="nofollow noreferrer ugc">Firefox</a>'s Enhanced Tracking Protection,<a href="https://en.wikipedia.org/wiki/Google_Analytics#cite_note-32" rel="nofollow noreferrer ugc">[32]</a> the browser extension <a href="https://en.wikipedia.org/wiki/NoScript" rel="nofollow noreferrer ugc">NoScript</a> and the mobile phone app <a href="https://en.wikipedia.org/wiki/Disconnect_Mobile" rel="nofollow noreferrer ugc">Disconnect Mobile</a> can block the Google Analytics Tracking Code. This prevents some traffic and users from being tracked and leads to holes in the collected data.</p></div></div><p>Users with privacy concerns can delete or block tracking cookies which would affect Google's ability to collect accurate data.</p><h2 data-id="using-vanilla-analytics-api">Using Vanilla Analytics API</h2><p>You can get more granular data from your Vanilla Analytics by our API. Create a POST request to see what browser traffic is contributing to your statistics.</p><div class="embedExternal embedImage"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/DCSKVBG1RKME/screen-shot-2020-06-01-at-5-22-30-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/DCSKVBG1RKME/screen-shot-2020-06-01-at-5-22-30-pm.png" alt="Screen Shot 2020-06-01 at 5.22.30 PM.png" /> </a> </div> </div> <pre class="code codeBlock" spellcheck="false" tabindex="0">curl --location --request POST 'https://forum.yoursite.com/api/v2/analytics/query' \ --header 'Authorization: Bearer your-access-token' \ --header 'Content-Type: text/plain' \ --data-raw '{ "collection": "page", "end": "2020-05-15T12:00:00.000Z", "filters": [ { "prop": "userAgentParsed.browser.family", "op": "eq", "val": "Googlebot" } ], "start": "2020-05-01T12:00:00.000Z", "type": "count" }' </pre> </article> </main>