Platform Analytics 7 Dataflow

Daily Job Slot Usage by Group datamart

Main ETL XML file

wi_jobSlotUsageByGroupDaily.xml

Output CSV files

jobslotusagebygroupdaily.csv

Record sample

DATE; WEEKDAY; COREHOUR; GROUPTYPE; GROUP; CLUSTERNAME; MAX_VALUE; MIN_VALUE; SUM_VALUE; COUNTER; MAX_UTIL; MIN_UTIL; AVG_UTIL; COUNTER_UTIL

"2005-11-03"; 5; 0.0; "By Host"; "host2972"; "lsf61"; 1.0; 1.0; 27.0; 27.0; 1; 1; 1; 27.0

"2005-11-03"; 5; 0.0; "By Host"; "host4408"; "lsf61"; 1.0; 1.0; 27.0; 27.0; 1; 1; 1; 27.0

Column description

The Daily Job Slot Usage by Group datamart gets its information from the wi_jobslotusagebygroupdaily table. Data is gathered and grouped by different group types, such as by user, host, project, queue, job group, and job name. The table records the total slot usage by that particular group type for one day. For host and queue group types, slot utilization is also calculated.

The following table briefly describes each column and its source:
Column name
Source and description
DATE
This is the date portion of the localized time_stamp column in the wi_jobslotusagebygroupdaily table, and the format is "YYYY-MM-DD".
WEEKDAY
This is the weekday number from the localized time_stamp column in the wi_jobslotusagebygroupdaily table, where "1" is Sunday, "2" is Monday, etc.
COREHOUR
The source is the core_hour in the wi_jobslotusagebygroupdaily table. The core_hour is based on the localized time_stamp. If core_hour is within the range of 9am to 5pm on Monday to Friday, this field returns 1, otherwise, it returns 0.
A value of 1 means that the data in this record is aggregated from the core_hour portion of the day (9am to 5pm, Monday to Friday).
GROUP_TYPE
Identifies whether the data is grouped by user, host, project, queue, job group, or job name.
GROUP
The source is dimension_name in the wi_dimensioncode table, as identified by group_code. The group reflects the group_type. For example, if group_type is by host, then the group is the host name; likewise, if group_type is by user, then the group is the user name.
CLUSTERNAME
The source is the wi_clustercode table, as identified by cluster_code. The cluster_code source is the wi_jobslotusagebygroupdaily table.
MAX_VALUE
This is the job slot's maximum usage per specific group-type during one day. The source is the max_slot_usage column in the wi_jobslotusagebygroupdaily table.
MIN_VALUE
This is the job slot's minimum usage per specific group-type during one day. The source is the min_slot_usage column in the wi_jobslotusagebygroupdaily table.
SUM_VALUE
This is the job slot's total usage per specific group-type during one day. The source is the total_slot_usage column in the wi_jobslotusagebygroup table.
COUNTER
This is the number of records that get consolidated for this record's slot usage calculation. The source is the counter column in the wi_jobslotusagebygroupdaily table
MAX_UTIL
This is the job slot's maximum utilization per a specific group-type during one day. The source is the max_slot_util column in the wi_jobslotusagebygroupdaily table. If the group-type is not by host or queue, then the value will be null.
The slot utilization is first calculated across the slot limit for each sample. Then, the slot utilization is calculated based on this limit for each sample. The maximum slot utilization is then taken for the hour.
MIN_UTIL
This is the job slot's minimum utilization per a specific group-type during one day. The source is the min_slot_util column in the wi_jobslotusagebygroupdaily table. If the group-type is not by host or queue, then the value will be null.
The slot utilization is first calculated across the slot limit for each sample. Then, the slot utilization is calculated based on this limit for each sample. The minimum slot utilization is then taken for the hour.
AVG_UTIL
This is the job slot's average utilization per a specific group-type during one day. The source is the avg_slot_util column in the wi_jobslotusagebygroupdaily table. If the group-type is not by host or queue, then the value will be null.
The slot utilization is first calculated across the slot limit for each sample. Then, the slot utilization is calculated based on this limit for each sample. The average slot utilization is then taken for the hour.
COUNTER_UTIL
This is the number of samples taken for a record and used for a slot utilization calculation. Since slot utilization is first summarized across the same time for the slot limit calculation, this value may be different from the counter column.