Ideas
ADD A NEW IDEA

FILTER BY CATEGORY

Discovery

Showing 26

Calculate uniqueness inter and intra cluster records to measure the correctness of DME output

As with the Spark v 2.3.0 or later version, has a API which can calculate measure of Clustering Prediction Score - It will nice to integrate this in our DME plugin as a confidence scoring model out of box.More details about the Spark API :https://...
Jitul Nath over 1 year ago in Data Quality 0 On the Backlog

Ability to parse multi level complex json file in ZDP

Current json serde unable to parse multi level json. can we add openx serde which is available as open source. details: open source link : https://github.com/rcongiu/Hive-JSON-Serdejar files link: http://www.congiu.net/hive-json-serde/1.3.8/cdh5/
Adi Bandaru almost 2 years ago in Data Ingestion / Catalog 0 On the Backlog

Provide ability to comment and crowdsource business information on ZDP entities/fields.

Business user/Data Stewards would like to collaborate and share their comments and findings on the data sets. They would like review these comments before underlying data can be provisioned to downstream systems. Provide ability to capture user fe...
Adi Bandaru almost 2 years ago in Metadata / Global Search / Catalog / Discovery / Provision 0 On the Backlog

Display ingestion history for db wizard, db import created entities in entity view ingestion history tab when Display is 'Ingested File Size Per Day'

Display ingestion history for db wizard, db import created entities in entity view ingestion history tab when Display is 'Ingested File Size Per Day'Current behavior:The 'Ingested File Size Per Day' is shown only for entities associated with file ...
Sanjay Yadav over 1 year ago in Data Ingestion / Catalog / Metadata 0 On the Backlog

Ability to auto tag entities based on the vocabulary/business glossary

Adding labels to the entity is tedious and time consuming effort. provide capability to auto tag based on the business glossary. or by referring business vocabulary. some of the competitors are leveraging modified Maui - Multi-purpose automatic to...
Adi Bandaru almost 2 years ago in Metadata / Catalog / Profiling / Data Classification 0 On the Backlog

Enable data profiling on Hive Views from profile of underlying Tables

Enable data profiling on views in Zaloni via linking with table instead of physically creating data profilingFrom customer: Views should not have their own profiling but profile information should come from the original table profiled from where t...
Sanjay Yadav about 2 years ago in Profiling 1 On the Backlog

Ingest History - File Row Count

Customer would like to have details on rows counts ingested from files. Today, ZDP 5.0.2 displays the File Size per Day and File Count per Day. User would like to validate that these match in both a visual and cumulative report. Suggestion: Prov...
Deleted User over 2 years ago in Data Ingestion 0 On the Backlog

Ability to ingest data from MS Excel (xls, xlsx)

Ingest directly from Excel (to save a step in converting to CSV)This is particularly useful for business users within the UI, who may upload xls/x files from their computers for ingestion
Guest over 2 years ago in Data Ingestion 0 On the Backlog

Incremental profiling of data

Incremental profiling of dataWhen an incremental ingestion (adding data to an existing entity) happens, can we provide a profile of the entire entity by only profiling the new data?For this customer, we will also need to make the incremental profi...
Sanjay Yadav over 2 years ago in Profiling 1 On the Backlog

Data profiling support for ZDP Entity with hive table on top of HBase table

We created a ZDP Entity on top of an HBase table (By specifying the storage handler as HBase). The data preview on this entity is working as expected. We tried to execute profiling on the same entity. Though the workflows succeeded the profiling d...
Mithulesh Kumar Medhi about 1 year ago in Profiling / Metadata 0 On the Backlog