DSS Backend Configuration Manual
Backend:
Overview
DSS has two sides to it. One being the process in which the Data is pooled to ElasticSearch and the other being the way it is fetched, aggregated, computed, transformed and sent across.
As this revolves around a variety of Data Set, there is a need for making this configurable. So that, tomorrow, given a new scenario is introduced, then it is just a configuration away from getting the newly introduced scenario involved in this flow of process.
This document explains the steps on how to define the configurations for both sides of DSS. Analytics and Ingest Pipeline Services.
Abbreviations
Ingest : Micro Service which runs as a pipeline and manages to validate, transform and enrich the incoming data and pushes the same to ElasticSearch Index
Analytics : Micro Service which is responsible for building, fetching, aggregating and computing the Data on ElasticSearch to a consumable Data Response. Which shall be later used for visualizations and graphical representations.
JOLT : JSON to JSON transformation library written in Java where the "specification" for the transform is itself a JSON document
Modules / Domain Level : These are the Services in this context. Each of the services, such as Property Tax, Trade License, Water and Sewerages are considered as Modules / Domains
Chart : Each individual graphical representation is considered as Chart in specific. For example, a Metric of Total Collection is considered as a Chart.
Visualization : Group of different Charts is considered as a Visualization. For example, the group of Total Collection, Target Collection and Target Achieved, is considered as a Metric Collection of Charts and thus it becomes a Visualization.
Ingest Pipeline Configurations
Below are list of configurations
Topic Context Configurations
Validator Schema
JOLT Transformation Schema
Enrichment Domain Configuration
JOLT Domain Transformation Schema
Descriptions :-
Topic Context Configurations
Topic Context Configuration is an outline to define which data is received on which Kafka Topic.
Indexer Service and many other services are sending out data on different Kafka Topics. If the Ingest Service is asked to receive those data and pass it through the pipeline, the context and the version of the data being received has to be set. This configuration is used to identify as in which kafka topic consumed the data and what is the mapping for that.
Figure 1.
Click here for Full Configuration
Parameter Name | Description |
topic | Holds the name of the Kafka Topic on which the data is being received |
dataContext | Context Name which needs to be set for further actions in the pipeline |
dataContextVersion | Version of the Data Structure is set here as there might be different structured data at different point of time |
2. Validator Schema
Validator Schema is a configuration Schema Library from Everit
By passing the data against this schema, it ensures whether the data abides by the rules and requirements of the schema which has been defined.
Figure 2.
Click here for an example configuration
3. JOLT Transformation Schema
JOLT is a JSON to JSON Transformation Library. In order to change the structure of the data and transform it in a generic way, JOLT has been used.
While the transformation schemas are written for each Data Context, the data is transformed against the schema to obtain a transformed data.
Follow the slide deck for JOLT Transformations
Figure 3.
Click here for an example configuration
4. Enrichment Domain Configuration
This configuration defines and directs the Enrichment Process which the data goes through.
For example, if the Data which is incoming is belonging to a Collection Module data, then the Collection Domain Config is picked. And the based on the Business Type specified in the data, the right config is picked.
In order to enhance the data of Collection, the domain index specified in the configuration is queried with the right arguments and the response data is obtained, transformed and set.
Click here for an example configuration
Paremter Name | Description |
id | Unique Identifier for the Configuration within the configuration document |
businessType | This defines as in which kind of Domain / Service is the data related to. |
indexName | Based on Business Type, Index Name is defined as to which index has to be queried to get the enhancements done from |
query | Query to execute to get the Domain Level Object is defined here. |
targetReferences sourceReference
| Fields which are variables in order to get the domain level objects are defined here. The variables and where all the values has to be picked from are documented here |
5. JOLT Domain Transformation Schema
As a part of Enhancement, once the domain level object is obtained, we might not need the complete document as is in the end data product.
Only those parameters which should be or can be used for an aggregation and representation are to be held and others are to be discarded.
In order to do that, we make use of JOLT again and write schemas to keep the required ones and discard the unwanted ones.
The above configuration is used to transform the data response in the enrichment layer.
Figure 4.
Click here for an example configuration
6. Use case:- JOLT Transformation Schema for collection V2
JOLT transformation schema for payment-v1 has taken as a use case to explain the context collection and context version v2. The payment records are processed/transformed with the schema. The schema supports splitting the billing records in a independent new record. So if there are 2 bill items in the collection/payment incoming data then this results into 2 collection records in turn.
Figure 5.
Click here for an example configuration
Here: $i, the variable value that gets incremented for the number of records of paymentDetails
$j, the variable value that gets incremented for the number of records of billDetails.
Note: For kafka connect to work, Ingest pipeline application properties or in environments direct push must be disabled.
es.push.direct=false
Analytics Configurations
Below are list of configurations
Chart API Configuration
Master Dashboard Configuration
Role Dashboard Mappings Configuration
Description :
Chart API Configuration
Each Visualization has their own properties. Each Visualization comes from different data sources (Sometimes it is a combination of different data sources)
In order to configure each visualization and their properties, we have Chart API Configuration Document.
In this, Visualization Code, which happens to be the key, will be having its properties configured as a part of configuration and are easily changeable.
Figure 6
Click here for an example configuration
Parameter Name | Description |
Key (e.g: totalApplication) | This is the Visualization Code. This key will be referred in further visualization configurations. |
chartName | The name of the Chart which has to be used as a label on the Dashboard. The name of the Chart will be a detailed name. |
queries | Some visualizations are derived from a specific data source. While some other are derived from different data sources and are combined together to get a meaningful representation. |
queries.module | The module / domain level, on which the query should be applied on. Property Tax is PT, Trade License is TL. |
queries.indexName | The name of the index upon which the query has to be executed is configured here. |
queries.aggrQuery | The aggregation query in itself is added here. Based on the Module and the Index name specified, this query is attached to the filter part of the complete search request and then executed against that index |
queries.requestQueryMap | Client Request would carry certain fields which are to be filtered. The parameters specified in the Client Request are different from the parameters in each of these indexed documents. |
queries.dateRefField | Each of these modules have separate indexes. And all of them have their own date fields. When there is a date filter applied against these visualizations, each of them has to apply it against their own date reference fields. |
chartType | As there are different types of visualizations, this field defines as what is the type of chart / visualization that this data should be used to represent. Chart types available are: metric - this represents the aggregated amount/value for records filter by the aggregate es query pie - this represents the aggregated data on grouping. This is can be used to represent any line graph, bar graph, pie chart or donuts line - this graph/chart is data representation on date histograms or date groupings perform - this chart represents groping data as performance wise. table - represents a form of plots and value with headers as grouped on and list of its key, values pairs. xtable - represents a advanced feature of table, it has addition capabilities for dynamic adding header values. |
valueType | In any case of data, the values which are sent to plot, might be a percentage, sometimes an amount and sometimes it is just a count. |
action | Some of the visualizations are not just aggregation on data source. There might be some cases where we have to do a post aggregation computation. In these kind of cases, what is the action that has to be performed on that data obtained is defined in this parameter. |
documentType | The type of document upon which the query has to be executed is defined here. |
drillChart | If there is a drill down on the visualization, then the code of the Drill Down Visualization is added here. This will be used by Client Service to manage drill downs |
aggregationPaths | All the queries will be having Aggregation names in it. |
_comment | In order to display information on the “i” symbol of each visualization, Visualization Information is maintained in this field. |
2. Master Dashboard Configuration
Master Dashboard Configuration is the main configuration which defines as which are the Dashboards which are to be painted on screen.
It includes all the Visualizations, their groups, the charts which comes within them and even their dimensions as what should be their height and width.
Figure 7.
Click here for an example configuration
Parameter Name | Description |
name | Name of the Dashboard which has to be displayed as Page Heading |
id | Unique Identifier of the Dashboard which should be used later for Querying each of these Visualizations |
isActive | Active Indicator which can be used to quickly disable a dashboard if required. |
style | Style of the Dashboard. Whether it should be a linear one or tabbed one. This information is maintained in this parameter. |
visualizations | The list of visualizations which are to be displayed in the Dashboard are listed out here. |
visualizations.row | The row identifier for each Visualization are mentioned here |
The name of an individual visualization is added here | |
visualizations.vizArray | The list of Charts within the Visualization is specified in this list. |
Group of Charts is given an ID to have a placement on the Dashboard. This unique identifier is maintained in this field. | |
Group of Charts is given a name which can be displayed on the group on Dashboard in that row. | |
visualizations.vizArray.dimensions | Each of these group of charts are given a dimension based on which they are placed in a specific row in a dashboard |
visualizations.vizArray.vizType | As there are multiple charts grouped into one visualization, the type of Visualization needs to be specified in order to indicate to the client application as what goes inside each of these visualizations and charts inside them vizType used for any other dashboards:- metric-collection, chart, performing-metric
2. performing-metric:- Used perform chart type 3. chart:- Used chart type for pie, donut, table, bar, horizantalbar, line vizType used for Home page:- collection, module
2. module: used in UI style for specific width. |
visualizations.vizArray.noUnit
visualizations.vizArray.isCollapsible visualizations.vizArray.ref
| The value types of these charts are different. Some are numbers, some are amounts, some are percentage. In the case of amounts, there is a requirement as to display in Lakhs, Crores and Units.
The value type is for card/visualisation collapsible as boolean values. This object contains url (as mandatory), logoUrl (optional), type(optional). |
visualizations.vizArray.charts | The list of individual charts inside a Visualization Group is maintained in this array list |
Individual Chart Number Identifier to indicate the uniqueness of Charts | |
Name of the Chart which can be a header label for Charts within a Visualization | |
visualizations.vizArray.charts.code | Code of the Chart which is the indicator which has to be sent to Server Side to get the data for representing the Visualization. |
visualizations.vizArray.charts.chartType | Type of Chart which has to represent the data result set which is obtained is specified here chartType:- bar, horizontalBar, line, donut, pie, metric, table |
visualizations.vizArray.charts.filters | Filters which can be applied on the Visualization and what are the fields which are filterable are mentioned here. |
visualizations.vizArray.charts.headers | In some cases, there are headers which can be a title or an additional information for the Chart Data which gets represented. |
3. Role Dashboard Mappings Configuration
Master Dashboard Configuration which was explained earlier hold the list of Dashboards which are available.
Given the instance where Role Action Mapping is not maintained in the Application Service, this configuration will act as Role - Dashboard Mapping Configuration
In this, each Role is mapped against the Dashboard which they are authorized to see
This was used earlier when the Role Action Mapping of eGov was not integrated.
Later, when the Role Action Mapping started controlling the Dashboards to be seen on the client side, this configuration is just used to enable the Dashboards for viewing.
Figure 8.
Click here for an example configuration
Parameter Name | Description |
roles | List of Roles which are available in the system |
roles._comment | Role Description and a comment on why does this role has an entry in this configuration and sums up the summary as what are the things which are to be enabled. |
roles.roleId | Unique Identifier of the Role for which Access is being given |
roles.roleName | Name of the Role for which the access is being given |
roles.isSuper | Boolean flag which defines as whether the Role is a Super User or not |
roles.orgId | Organization to which the Role belongs to |
roles.dashboards | List of Dashboards that are enabled for the Role |
Name of the individual Dashboard which has been enabled | |
Identifier of the individual Dashboard which has been enabled |
Adding Configurations
Adding Roles and Dashboards :
To add new role, RoleDashboardMappingsConf.json (roles node) configuration file has to be modified as below
Note: Any number of roles & dashboards can be added
Below as in Figure 9. is a sample to add new role object, new dashboard object
Figure 9.
To add new dashboard, MasterDashboardConfig.json (dashboards node) has to be modified as below in Figure 10.
Note: dashboards array add a new dashboard as given below
Figure 10.
Adding Visualizations in existing Dashboard
To add new visualisations, again MasterDashboardConfig.json (vizArray node) has to be modified as below as shown in Figure 11.
Note: vizArray is to hold multiple visualizations
Figure 11.
Adding charts for visualizations :
To add new chart, chartApiConf.json has to be modified as shown below. A new chartid has to be added with chart node object.
Metric chart Sample as shown in Figure 12.
Figure 12.
Pie chart Sample as shown in Figure 13.
Figure 13.
Line chart Sample as shown in Figure 14.
Figure 14.
Table chart Sample: This chart comes with 2 kind, table and xtable.
table (as shown in Figure 15.) type allows to added aggregated fields added as available in the query keys, hence to extract the values based on the key, aggegationPaths needs to add along with their data type as in pathDataTypeMapping.
Figure 15.
xtable(as shown in Figure 16.) type allows to add multiple computed fields with the aggregated fields dynamically added.
To add multiple computed columns, computedFields [] where actionName (IComputedField<T> interface), fields [] names as in exist in query key, newField as name to appear for computation must be defined.
Figure 16.
https://github.com/egovernments/configs/blob/master/egov-dss-dashboards/dashboard-analytics/ChartApiConfig.json for the full configuration in detail.
Steps to create a charts and visualise are:
Create/Add a chart in chartApiConf.json
Add a visualization for existing dashboard in MasterDashboardConfig.json as defined above.
Or in order to create/add a new dashboard create dashboard in MasterDashboardConfig.json and create a role in RoleDashboardConfig.json
Configuration Changes for DrillThrough :
1. Example Drill through in Ward table in Property Dashboard.
wardDrillDown is the visualization code for PT Drill Down. kind is the attribute that shows the type of visualization code. Apart from two things all the attributes are common.
2. Example Drill through in ComplaintList table in PGR Dashboard.
complaintDrillDown is the visualization code for PGR Drill Down.
The above complaintDrillDown visualization code called in the drill chart parameter.