...
To help work through the schema proposals, we will work through a use case to normalize data from the internal collectd format to the ceilometer format.
Ceilometer use case
Ceilometer provides the possibility to submit samples via the REST API to allow users to send custom samples into this service. The samples that can be sent to Telemetry are not limited to the actual existing meters. There is a possibility to provide data for any new, customer defined counter by filling out all the required fields of the POST request. If the sample corresponds to an existing meter, then the fields like meter-type
and meter name should be matched accordingly.
...
As you can see, the meter sample aren't mapped to look like the meter samples submitted from an OpenStack service where there is overlap.
So where there is overlap in meters in terms of the samples collected by collectd and published to ceilometer, we would like to describe the mapping in a generic way.
Proposal 1
Looking at a combine Schema. This schema describes the collectd internal types, and the ceilometer fields that reference them
...
Code Block | ||||
---|---|---|---|---|
| ||||
{ "$schema": "http://json-schema.org/draft-04/schema#", "title": "collectd Base Schema", "description": "A schema that defines the collectd meters and events", "definitions": { "value list": { "description": "Statistics in collectd consist of a value list", "type": "object", "properties": { "values" : { "type": "array", "anyOf" : [ { "type" : "object", "properties" : { "absolute" : { "type" : "number" } } }, { "type" : "object", "properties" : { "counter" : { "type" : "number" } } }, { "type" : "object", "properties" : { "derive" : { "type" : "number" } } }, { "type" : "object", "properties" : { "gauge" : { "type" : "number" } } } ] }, "value length" : { "description": "The number of values in the data set", "type": "number" }, "time" : { "description": "Time stamp at which the value was collected", "type": "number" }, "Interval" : { "description": "interval at which to expect a new value", "type": "number" }, "host" : { "description": "used to identify the host", "type": "string" }, "plugin" : { "description": "used to identify the plugin", "type": "string" }, "plugin instance" : { "description": "used to group a set of values together", "type": "string" }, "type" : { "description": "unit used to measure a value", "type": "string" }, "type instance " : { "description": "used to distinguish between values that have an identical type", "type": "string" }, "metadata" : { "description": "an opaque data structure that enables the passing of additional information about a value list", "type": "string" } } }, "notifications": { "description": "Notifications in collectd are generic messages", "type": "object", "properties": { "severity" : { "description": "can be one of OKAY, WARNING, and FAILURE", "type": "string" }, "time" : { "description": "Time stamp at which the event was collected", "type": "number" }, "message" : { "description": "The notification message", "type": "string" }, "host" : { "description": "used to identify the host", "type": "string" }, "plugin" : { "description": "used to identify the plugin", "type": "string" }, "plugin instance" : { "description": "used to group a set of values together", "type": "string" }, "type" : { "description": "unit used to measure a value", "type": "string" }, "type instance " : { "description": "used to distinguish between values that have an identical type", "type": "string" }, "metadata" : { "description": "an opaque data structure that enables the passing of additional information about a value list", "type": "string" } } } }, "counter_name": [ {"$ref": "#/definitions/value list/plugin"}, {"$ref": "#/definitions/value list/plugin instance"} ], "counter_unit": [ {"$ref": "#/definitions/value list/type_instance"}, {"$ref": "#/definitions/value list/type"} ], "counter_volume": { "$ref": "#/definitions/value list/values" }, "resource_id": [ {"$ref": "#/definitions/value list/host"}, {"$ref": "#/definitions/value list/plugin"}, {"$ref": "#/definitions/value list/plugin instance"} ], "timestamp": { "$ref": "#/definitions/value list/time" }, "resource_metadata": { "$ref": "#/definitions/value list/metadata" }, "source": { "type": "string" }, "user_id": { "type": "string" } } |
Issues:
Proposal 1 is too high level and doesn't get into the internals of what meters get mapped to what and what units should be associated with the meters/events - in essence it doesn't improve on what's there today for overlapping meters. However is good for new meters.
Proposal 2
Involves 2 schema:
- The final message format schema
- The mapping schema
...
Code Block | ||||
---|---|---|---|---|
| ||||
{ "$schema": "http://json-schema.org/draft-04/schema#", "title": "Collectdcollectd MappingBase Schema", "description": "A mappingschema fromthat collectddefines meters and events to *other*the collectd meters and events", "type": "object", "properties"definitions": { "resourcecollectd_idfield": { "description": "Mappings from collectd resource_id to other framework resource_id_fields", "type": "object", "itemsproperties": { "typeanyOf" : "array", { "itemsInterval" : { "collectd_resource_id "description": "stringinterval at which to expect a new value", "new_resource_id "type": "stringnumber" }, } "host" : { }, "meters": { "description": "Mappings from collectd metersused to otheridentify frameworkthe metershost", "type": "array", "type": "itemsstring": { "type": "array", }, "itemsmessage" : { "collectd_meter_stringsdescription": "stringThe notification message", "new_metertype": "string", "collectd_unit": "string"}, "new_meter_unitmetadata" : "string", { "unit_conversiondescription": "string"an opaque data structure that enables the passing of additional information about a value list", } } }, "type": "string" "events": { "description": "Mappings from collectd events to other framework events",}, "typeplugin" : "object",{ "items": { "description": "used to identify the "type": "array",plugin", "itemstype": {"string" "collectd_event": "string"}, "new_event": "string",plugin instance" : { "collectd_severitydescription": "stringused to group a set of values together", "new_severitytype": "string" } }, } }, "severity" : { "collectd_plugin_to_field_map": { "description": "Mappingscan be one of collectd_plugin values to Fields", OKAY, WARNING, and FAILURE", "type": "object", "type": "itemsstring": { "type": "object", }, "itemstime" : { "Field "description": "stringTime stamp at which the value was collected", "collectd_field": {type": "number" }, "type" : "array",{ "itemsdescription": {"unit used to measure a value", "collectd_plugin_nametype": "string", "collectd_plugin_type": "string" "collectd_plugin_type_instance": "string" }, } "type instance " : { } "description": "used to }distinguish between values that have an identical type", } } } } |
Taking the following ceilometer meter as an example:
Code Block |
---|
- name: 'vcpus'"type": "string" event_type: 'compute.instance.*' type: 'gauge' }, unit: 'vcpu' volume: $.payload.vcpus user_id: $.payload.user_id "values" project_id: $.payload.tenant_id{ resource_id: $.payload.instance_id |
...
"type": "object",
"anyOf" : {
"absolute" : { "type" : "string" } ,
"counter" : { "type" : "string" },
"derive" : { "type" : "string" },
"gauge" : { "type" : "string" },
"actual_values" : { "type" : "string" }
}
},
"value length" : {
"description": "The number of values in the data set",
"type": "number"
}
}
}
},
"mappings": {
"description": "mapping pair",
"type": "object",
"properties": {
"oneOf": {
"$ref": "#/definitions/collectd_field",
"collectd_string": { "type": "string" }
},
"new_string": { "type": "string" },
"new_value": { "type": "string" }
}
},
"mappings_with_conversion": {
"description": "Mappings from collectd timestamps to other framework timestamps",
"type": "object",
"properties": {
"transform": {
"description": "array of mappings from collectd to other frameworks",
"type": "array",
"items": {
"$ref": "#/definitions/mapping",
"conversion": "string"
}
}
},
"required": [ "mappings" ]
}
},
"field_mappings": {
"description": "array of field_mappings",
"type": "array",
"items": {
"set": {
"description": "set of associated of field_mappings",
"type": "array",
"anyOf":{
"$ref": "#/definitions/mappings",
"$ref1": "#/definitions/mappings_with_conversion"
}
}
}
}
} |
Taking the following ceilometer meter as an example:
Code Block |
---|
- name: 'compute.node.cpu.percent'
event_type: 'compute.metrics.update'
type: 'gauge'
unit: 'percent'
volume: $.payload.metrics[?(@.name='cpu.percent')].value * 100
resource_id: $.payload.host + "_" + $.payload.nodename
timestamp: $.payload.metrics[?(@.name='cpu.percent')].timestamp
metadata:
event_type: $.event_type
host: $.publisher_id
source: $.payload.metrics[?(@.name='cpu.percent')].source
|
Using the schema above:
Code Block | ||
---|---|---|
| ||
{
"field_mappings": [
"set": [
{
"new_string": "counter_name",
"collectd_field": {
"plugin": "cpu"
},
"conversion": "cpu"
},
{
"new_string": "counter_type",
"collectd_field": {
"values": {
"gauge": "cumulative"
}
}
},
{
"new_string": "counter_volume",
"collectd_field": {
"values": {
"actual_values": "values_array"
}
}
"conversion": "None"
},
{
"new_string": "timestamp",
"collectd_string": "timestamp"
},
{
"new_string": "source",
"new_value": "collectd"
},
{
"new_string": "resource_id",
"collectd_field": {
"values": {
"host": "uuid",
"plugin instance": "plugin_instance"
}
}
}
]
]
} |
Key_words: values_array.
References
http://docs.openstack.org/admin-guide/telemetry-data-collection.html
...