by: Helge, published: Oct 29, 2015, updated: Apr 19, 2025, in

Logs & Metrics

Splunk Accelerated Data Models – Part 2

This article is based on my Splunk .conf 2015 session and is the second in a mini-series on Splunk data model acceleration. Make sure to read part 1 first.

Under the Hood

HPAS Population

The high-performance analytics store (HPAS) is populated by scheduled searches that run every 5 minutes.

The HPAS spans a user-defined time range. Old events are purged automatically by a maintenance process that runs every 30 minutes.

Populating Searches

One auto summarizing search is added to the scheduler per data model object. These searches have a low priority and the total number of them is limited. This can be a problem if your data models have many objects and/or your Splunk instance is underpowered. If the summarizing searches cannot be run as often as they should the data model may become stale.

The following two settings in limits.conf can be used to tweak how many auto summarizing searches are allowed to run:

max_searches_perc: Percentage of system-wide concurrent searches the scheduler can run. Default: 50
auto_summary_perc: Percentage of scheduler searches to be used for auto summarization. Default: 50

With the default values only 25% of all concurrent searches are available for data model acceleration!

Checking Acceleration Status

A data model’s acceleration status can easily be checked from the UI by navigating to Settings > Data Models and clicking the little arrow next to a data model’s name:

Splunk - Checking the data model acceleration status

To check the acceleration status from a search use the following:

| tstats summariesonly=t min(_time) as min,
         max(_time) as max count from datamodel=uberAgent
| eval "Start time"=strftime(min, "%c")
| eval "End time"=strftime(max, "%c")
| eval "Event count"=count
| fields "Start time" "End time" "Event count"

This search will return the dates of the earliest and latest events in the high-performance analytics store (HPAS). Note that by adding the parameter summariesonly=t we only search the HPAS.

Data Models and Apps

Data Model Definition

A data model definition is stored in a file called modelname.json in the directory $SPLUNK_HOME\etc\apps\appname\default\data\models.

The data model definition resides on the search heads and is sent to the indexers as part of the replication bundle.

Important: If the data model definition exists on multiple independent search heads, multiple copies of the HPAS are created! This unecessarily increases indexer CPU load and disk size requirements. This warning applies to independent search heads only, not to search heads in a cluster.

Enabling Acceleration

To enable acceleration for an app’s data model add the following to the app’s datamodels.conf:

[uberAgent]
acceleration = 1
acceleration.earliest_time = -1w

About the Author

Helge Klein (ex CTP, MVP, and vExpert) worked as a consultant and developer before founding vast limits, the uberAgent company, which was acquired by the Citrix business unit of Cloud Software Group in late 2023. Previously, Helge applied his extensive knowledge in IT infrastructure projects and architected a user profile management product, the successor of which is now available as Citrix Profile Management. Helge is the author of the popular tools Delprof2 and SetACL. He has presented at Citrix Synergy, BriForum, E2EVC, Splunk .conf, and many other events.

3 Comments

daniel sofoulis

August 30, 2018 at 07:18

I have 2 independent search heads that require searching across accelerated datamodels.
How can I have both of the independent search heads able to search accelerated datamodels without having multiple copies of the HPAS created?

Helge Klein

September 2, 2018 at 14:48

Unfortunately, that is not possible. One instance of the high-performance analytics store (HPAS) is generated per search head (cluster). Related Splunk Answers post: https://answers.splunk.com/answers/544456/is-there-a-way-to-share-a-data-model-across-2-sear.html

Reply

bob

July 8, 2019 at 04:42

I just want to speed up my search through the data model. I just need the simplest search, such as `index = Test name = jack’. However, through data model search, returning fields is slower than normal search. The statement I used is `pivot data model dataset SPLITROW name as new_name FILTER name is jack’. What should I do to speed up my search?

Splunk Accelerated Data Models – Part 2

Under the Hood

HPAS Population

Populating Searches

Checking Acceleration Status

Data Models and Apps

Data Model Definition

Enabling Acceleration

About the Author

3 Comments

Leave a Reply Cancel reply

Related Posts

Splunk Accelerated Data Models – Part 3

Splunk Accelerated Data Models – Part 1

Splunk Scripted Input Secrets

What Is Splunk and How Does It Work?

Latest Posts

Enertex KNX IP Secure Router: Initial Configuration Without 2nd Interface

How to Sync & Backup Frigate NVR Recordings to Offsite Cloud Storage

Frigate: NVR With Object Detection on Raspberry Pi 5 & Coral TPU

Elasticsearch ES|QL: Energy Consumption Chart With Home Assistant Data