This is the column description of each data column of WI_NUMBEROFJOBSDAILY and how each column is filled with data.
Columns |
Description |
Key |
CLUSTER_CODE |
This comes from the "
CLUSTER_CODE" field in the hourly table. |
Primary key |
TIME_STAMP |
This is in GMT and is always end in the day. The record is aggregated in a localized day. After the aggregation, the time_stamp is then transformed back at GMT and stored in this field. Thus, this time_stamp always in the GMT form of the localized day in midnight. |
Primary key |
GROUP_NAME |
This indicates what kind of grouping the statistics are summed against. There are five kind of grouping, "By Cluster", "By Host", "By User", "By Project" or "By Queue". |
Primary key |
GROUP_CODE |
This could come from different source and it is depends on the group_type. If the group_type is "By Host", then it comes from "HostName" field of the data file. If the group_type is "By Cluster", then it comes from "ClusterName" field of the data file. If the group_type is "By Queue", then it comes from "QueueName" field of the data file. If the group_type is "By Project", then it comes from "ProjectName" field of the data file. If the group_type is "By User", then it comes from "UserName" field of the data file. Once we get the field, we then look it up in the wi_dimensioncode table to see if we have already has a record of it in there by matching the dimension_name. If we do, then we'll get the code back, otherwise, we will insert it into the wi_dimensioncode table and generate the code. The code itself is a positive integer and each new code is equal to the maximum of the existing code+1. |
Primary key |
JOB_STATUS |
This field contains the indicator of which job status the value fields are aggregated of under the grouping. It could be one of "SUSP" (suspended job), "RUN" (running job), "WAIT" (waiting job) or "PEND" (pending job). |
Primary key |
JOB_TYPE |
This is the type of job. It could be one of "BOTH", "BATCH", "INTERACTIVE" and "PARALLEL". |
Primary key |
TOTAL_JOB_NUMBER |
This is aggregated based on different group_type. If the group_type is "By Project", then the aggregation is grouped by "ProjectName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Host", then the aggregation is grouped by "HostName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By User", then the aggregation is grouped by "UserName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Queue", then the aggregation is grouped by "QueueName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Cluster", then the aggregation is first aggregate across the interval based on "ClusterName", "JobType" and "Timediff". Then it will aggregate for the day and is grouped by "ClusterName", "Jobtype" and "Timediff". The aggregation will then calculate the sum, the minimum, the maximum and count the records for the four type of job status for the day. The sum will be put into this column for each record. |
|
MIN_JOB_NUMBER |
This is aggregated based on different group_type. If the group_type is "By Project", then the aggregation is grouped by "ProjectName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Host", then the aggregation is grouped by "HostName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By User", then the aggregation is grouped by "UserName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Queue", then the aggregation is grouped by "QueueName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Cluster", then the aggregation is first aggregate across the interval based on "ClusterName", "JobType" and "Timediff". Then it will aggregate for the day and is grouped by "ClusterName", "Jobtype" and "Timediff". The aggregation will then calculate the sum, the minimum, the maximum and count the records for the four type of job status for the day. The minimum will be put into this column for each record. |
|
MAX_JOB_NUMBER |
This is aggregated based on different group_type. If the group_type is "By Project", then the aggregation is grouped by "ProjectName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Host", then the aggregation is grouped by "HostName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By User", then the aggregation is grouped by "UserName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Queue", then the aggregation is grouped by "QueueName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Cluster", then the aggregation is first aggregate across the interval based on "ClusterName", "JobType" and "Timediff". Then it will aggregate for the day and is grouped by "ClusterName", "Jobtype" and "Timediff". The aggregation will then calculate the sum, the minimum, the maximum and count the records for the four type of job status for the day. The maximum will be put into this column for each record. |
|
COUNTER |
This is aggregated based on different group_type. If the group_type is "By Project", then the aggregation is grouped by "ProjectName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Host", then the aggregation is grouped by "HostName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By User", then the aggregation is grouped by "UserName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Queue", then the aggregation is grouped by "QueueName", "ClusterName", "Jobtype" and "Timediff". If the group_type is "By Cluster", then the aggregation is first aggregate across the interval based on "ClusterName", "JobType" and "Timediff". Then it will aggregate for the day and is grouped by "ClusterName", "Jobtype" and "Timediff". The aggregation will then calculate the sum, the minimum, the maximum and count the records for the four type of job status for the day. The count will be put into this column for each record. |
|
CORE_HOUR |
The core hour is resulted from a mapping of the original time_stamp in local time. If the time_stamp is within the range from Monday to Friday, 9am to 5pm, then the core_hour will be 1, otherwise it will be 0. |
|
INSERT_SEQ |
This is a system generated sequence number. For each new record inserted, a unique sequence number is being assigned to this column. |
|