Viewing the Category Importance information in Beam
See which categories have the most impact
Beam category importance uses advanced machine learning models to identify the event categories that have the greatest impact on your demand. This is done automatically after you create an analysis and upload demand data for an individual location. The results are a list of categories with a significance rating for each. The category importance results are for an individual analysis of the location you selected for Beam.
For example, restaurants tend to be highly impacted by sports and public holidays while hotels tend to be more impacted by conferences and concerts. By running category importance you can see how your business locations are impacted by events.
Beam category importance covers the attendance and non-attended categories shown plus severe weather in the unscheduled category. It does not cover other categories not shown (such as disaster events).
Below is an example:
In this example, we can see a very high impact from performing arts, sports, expos, school holidays, academic, and public holidays. The other categories do not impact this location and are shown as "None". If a category shows "None" then it does not have a statistically significant impact on demand based on the data uploaded or there are no events from this category taking place within the selected radius around the location.
You can use Beam to find out how different types of events are impacting your demand. If you are not sure what types of events impact your demand then create a Beam analysis with your demand data and look at the results.
Many customers use the Beam results to help decide what event-based machine learning features to add to their machine learning models.
Beam uses the Get Feature Importance API call to return sets of features related to important categories based on the category importance results for an analysis. Anything that you can do in the Control Center UI can also be done via the Beam API. Use the Beam API if you want to create a large number of analyses.
There is a toggle to view as p-values for advanced users. This is intended for use by data science teams interested in digging deeper into the results. P-values represent statistical significance. Category importance uses a significance level of 0.1 such that p-values less than 0.1 indicate important categories with varying degrees of impact. Categories with p-values 0.1 or greater are not important.
Note: very small p-values will be shown as <0.0000. This can refer to results that are smaller than 0.0001, e.g. 0.00005.
The mapping of p-values to the readable impacts shown is below:
p ≥ 0.1 - None 0.75 ≤ p < 0.1 - Moderate 0.05 ≤ p < 0.75 - High p < 0.05 - Very high