Why is my analytics data inflated? - HL Vanilla Community
<main> <article class="userContent"> <p>If you're using <strong>Advanced Analytics</strong>, <strong>Higher Logic Vanilla</strong> collects traffic statistics for your community using a service called <strong>Keen.io</strong>. </p><p>The rules for Advanced Analytics are as follows: </p><ul><li>Any page that is requested that responds with an HTTP code 200.</li><li>The initiator of the page load can be any user or guest. </li><li>The initiator can even be a bot.</li></ul><p>You might be comparing your Advanced Analytics statistics with another service, such as Google Analytics, and finding some discrepancies between the two. Google Analytics can be configured in multiple ways to return different data. We have no control over that. Most likely Google filters out crawlers (likely their own).</p><p>As stated, Vanilla's Advanced Analytics tool does not filter bots. It can be easily under-estimated how much traffic comes from crawlers, like GoogleBot. As an example, here's a 15-day period of a site where we query all of the traffic in our access logs, <em>excluding web crawlers</em>:</p><div class="embedExternal embedImage display-large float-none"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/8GQSGHVCVA93/screen-shot-2020-06-01-at-4-37-06-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/8GQSGHVCVA93/screen-shot-2020-06-01-at-4-37-06-pm.png" alt="Screen Shot 2020-06-01 at 4.37.06 PM.png" height="532" width="2156" loading="lazy" data-display-size="large" data-float="none"></img></a> </div> </div> <p>Here is the same period but showing <em>only web </em>crawlers:</p><div class="embedExternal embedImage display-large float-none"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/XAKUXGFD7J1G/screen-shot-2020-06-01-at-4-38-34-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/XAKUXGFD7J1G/screen-shot-2020-06-01-at-4-38-34-pm.png" alt="Screen Shot 2020-06-01 at 4.38.34 PM.png" height="526" width="2162" loading="lazy" data-display-size="large" data-float="none"></img></a> </div> </div> <p>Together that makes almost one million hits, but almost a <strong>quarter million</strong> are coming from bots. The majority of those bots are coming from Google, and so these would be excluded from Google Analytics.</p><p>In this example, the crawler traffic is fairly consistent, but it's not always like that: there are often periods of intense crawler activity, followed by relatively little crawler traffic.</p><h2 data-id="google-analytics">Google Analytics</h2><p>There are other factors that can skew Google Analytics statistics. From the Wikipedia article on Google Analytics: </p><div class="blockquote"><div class="blockquote-content"><p class="blockquote-line">[There are] many <a href="https://en.wikipedia.org/wiki/Ad_filtering" rel="nofollow noreferrer ugc">ad filtering</a> programs and extensions such as <a href="https://en.wikipedia.org/wiki/Firefox" rel="nofollow noreferrer ugc">Firefox</a>'s Enhanced Tracking Protection,<a href="https://en.wikipedia.org/wiki/Google_Analytics#cite_note-32" rel="nofollow noreferrer ugc">[32]</a> the browser extension <a href="https://en.wikipedia.org/wiki/NoScript" rel="nofollow noreferrer ugc">NoScript</a> and the mobile phone app <a href="https://en.wikipedia.org/wiki/Disconnect_Mobile" rel="nofollow noreferrer ugc">Disconnect Mobile</a> [that] can block the Google Analytics Tracking Code. This prevents some traffic and users from being tracked and leads to holes in the collected data.</p></div></div><p>Users with privacy concerns can delete or block tracking cookies, which would affect Google's ability to collect accurate data.</p><h2 data-id="using-vanilla-analytics-api">Using Vanilla Analytics API</h2><p>You can get more granular data from your Vanilla analytics via our API. Create a POST request to see what browser traffic is contributing to your statistics.</p><div class="embedExternal embedImage display-large float-none"> <div class="embedExternal-content"> <a class="embedImage-link" href="https://us.v-cdn.net/6030677/uploads/DCSKVBG1RKME/screen-shot-2020-06-01-at-5-22-30-pm.png" rel="nofollow noreferrer noopener ugc" target="_blank"> <img class="embedImage-img" src="https://us.v-cdn.net/6030677/uploads/DCSKVBG1RKME/screen-shot-2020-06-01-at-5-22-30-pm.png" alt="Screen Shot 2020-06-01 at 5.22.30 PM.png" height="1182" width="1292" loading="lazy" data-display-size="large" data-float="none"></img></a> </div> </div> <pre class="code codeBlock" spellcheck="false" tabindex="0">curl --location --request POST 'https://forum.yoursite.com/api/v2/analytics/query' \ --header 'Authorization: Bearer your-access-token' \ --header 'Content-Type: text/plain' \ --data-raw '{ "collection": "page", "end": "2020-05-15T12:00:00.000Z", "filters": [ { "prop": "userAgentParsed.browser.family", "op": "eq", "val": "Googlebot" } ], "start": "2020-05-01T12:00:00.000Z", "type": "count" }' </pre> </article> </main>