When we are importing from an Oracle database the user is facing problems in type conversion. The user is importing columns having the Number datatype in Oracle. The expected behavior that they want while importing the columns is that: Number data...
Kavel Baruah
almost 2 years ago
in Data Ingestion
0
On the Backlog
Data profiling support for ZDP Entity with hive table on top of HBase table
We created a ZDP Entity on top of an HBase table (By specifying the storage handler as HBase). The data preview on this entity is working as expected. We tried to execute profiling on the same entity. Though the workflows succeeded the profiling d...
Mithulesh Kumar Medhi
about 2 years ago
in Metadata / Profiling
0
On the Backlog
Calculate uniqueness inter and intra cluster records to measure the correctness of DME output
As with the Spark v 2.3.0 or later version, has a API which can calculate measure of Clustering Prediction Score - It will nice to integrate this in our DME plugin as a confidence scoring model out of box. More details about the Spark API : https:...
Jitul Nath
over 2 years ago
in Data Quality
0
On the Backlog
Display ingestion history for db wizard, db import created entities in entity view ingestion history tab when Display is 'Ingested File Size Per Day'
Display ingestion history for db wizard, db import created entities in entity view ingestion history tab when Display is 'Ingested File Size Per Day' Current behavior: The 'Ingested File Size Per Day' is shown only for entities associated with fil...
Provide ability to comment and crowdsource business information on ZDP entities/fields.
Business user/Data Stewards would like to collaborate and share their comments and findings on the data sets. They would like review these comments before underlying data can be provisioned to downstream systems. Provide ability to capture user fe...
Ability to auto tag entities based on the vocabulary/business glossary
Adding labels to the entity is tedious and time consuming effort. provide capability to auto tag based on the business glossary. or by referring business vocabulary. some of the competitors are leveraging modified Maui - Multi-purpose automatic to...
Clients Scenario: I have an entity with custom data format. This entity is associated with an EDQ action of a post ingestion WF. While executing the action, I got the exception "Entity with data file CUSTOM is not supported by EDQ"
Ability to compare data in trusted zone with source of truth data
As a data steward or member of governance team, I want the ability to compare data stored in trusted zone with golden record/source of truth data stored in source platform, so that I can ensure data quality and completeness.
Nikhil Goel
almost 3 years ago
in Data Quality
0
On the Backlog
Ability to parse multi level complex json file in ZDP
Current json serde unable to parse multi level json. can we add openx serde which is available as open source. details: open source link : https://github.com/rcongiu/Hive-JSON-Serde jar files link: http://www.congiu.net/hive-json-serde/1.3.8/cdh5/