WO2009009192A3 - Adaptive archive data management - Google Patents

Adaptive archive data management Download PDF

Info

Publication number
WO2009009192A3
WO2009009192A3 PCT/US2008/060755 US2008060755W WO2009009192A3 WO 2009009192 A3 WO2009009192 A3 WO 2009009192A3 US 2008060755 W US2008060755 W US 2008060755W WO 2009009192 A3 WO2009009192 A3 WO 2009009192A3
Authority
WO
WIPO (PCT)
Prior art keywords
data
classification
relevant
determined
definition
Prior art date
Application number
PCT/US2008/060755
Other languages
French (fr)
Other versions
WO2009009192A2 (en
Inventor
Aloke Guha
Joan Wrabetz
Original Assignee
Aumni Data Inc
Aloke Guha
Joan Wrabetz
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Aumni Data Inc, Aloke Guha, Joan Wrabetz filed Critical Aumni Data Inc
Publication of WO2009009192A2 publication Critical patent/WO2009009192A2/en
Publication of WO2009009192A3 publication Critical patent/WO2009009192A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Abstract

In one embodiment, input is received from a user defining a classification and an analytic for the classification. Multiple classifications and analytics may be defined by a user. A definition of relevance parameters is determined that characterize the classification and a set of analytics measures associated with the analytic. The definition may be for the classification. Unstructured data and structured data are analyzed based on the definition of the relevance parameters to determine relevant data in the unstructured data and the structured data. The relevant data being data that is determined to be relevant to the classification defined by the user. An index of the terms from the relevant data is determined. The index is useable by an analytics tool to provide results for queries of the unstructured data and structured data. The query may be used within the classification such that targeted results are provided using the index and the relevant data to the classification. Thus, queries from different classifications may be performed efficiently using data determined to be relevant to the classification.
PCT/US2008/060755 2007-04-18 2008-04-18 Adaptive archive data management WO2009009192A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US91265207P 2007-04-18 2007-04-18
US60/912,652 2007-04-18
US1276107P 2007-12-10 2007-12-10
US61/012,761 2007-12-10

Publications (2)

Publication Number Publication Date
WO2009009192A2 WO2009009192A2 (en) 2009-01-15
WO2009009192A3 true WO2009009192A3 (en) 2009-06-04

Family

ID=39873263

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/060755 WO2009009192A2 (en) 2007-04-18 2008-04-18 Adaptive archive data management

Country Status (2)

Country Link
US (2) US7912816B2 (en)
WO (1) WO2009009192A2 (en)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7668884B2 (en) 2005-11-28 2010-02-23 Commvault Systems, Inc. Systems and methods for classifying and transferring information in a storage network
US8930496B2 (en) 2005-12-19 2015-01-06 Commvault Systems, Inc. Systems and methods of unified reconstruction in storage systems
US20200257596A1 (en) 2005-12-19 2020-08-13 Commvault Systems, Inc. Systems and methods of unified reconstruction in storage systems
DE102007011407A1 (en) * 2007-03-08 2008-09-11 Fujitsu Siemens Computers Gmbh Device for processing non-structured data and for storing associated metadata, comprises storage unit and interface for reading non-structured data, where coding unit is provided for temporarily coding of data
WO2009009192A2 (en) 2007-04-18 2009-01-15 Aumni Data, Inc. Adaptive archive data management
US8244713B2 (en) 2007-07-12 2012-08-14 International Business Machines Corporation Content management system that retrieves data from an external data source and creates one or more objects in the repository
US8296301B2 (en) 2008-01-30 2012-10-23 Commvault Systems, Inc. Systems and methods for probabilistic data classification
US7836174B2 (en) 2008-01-30 2010-11-16 Commvault Systems, Inc. Systems and methods for grid-based data scanning
US8266148B2 (en) 2008-10-07 2012-09-11 Aumni Data, Inc. Method and system for business intelligence analytics on unstructured data
US8977645B2 (en) 2009-01-16 2015-03-10 Google Inc. Accessing a search interface in a structured presentation
US20100185651A1 (en) * 2009-01-16 2010-07-22 Google Inc. Retrieving and displaying information from an unstructured electronic document collection
US8452791B2 (en) 2009-01-16 2013-05-28 Google Inc. Adding new instances to a structured presentation
US8615707B2 (en) * 2009-01-16 2013-12-24 Google Inc. Adding new attributes to a structured presentation
US8412749B2 (en) 2009-01-16 2013-04-02 Google Inc. Populating a structured presentation with new values
US20100306223A1 (en) * 2009-06-01 2010-12-02 Google Inc. Rankings in Search Results with User Corrections
US9009137B2 (en) * 2010-03-12 2015-04-14 Microsoft Technology Licensing, Llc Query model over information as a networked service
AU2011201381B1 (en) 2011-03-25 2012-02-02 Brightcove Inc. Multiple phase distributed reduction analytics performance enhancements
AU2011201380B1 (en) 2011-03-25 2012-02-02 Brightcove Inc. Analytics performance enhancements
US9373078B1 (en) 2011-04-21 2016-06-21 Anametrix, Inc. Methods and systems for predictive alerting
US9395883B1 (en) 2011-08-29 2016-07-19 Anametrix, Inc. Systems and method for integration of business analytics and business networking
US8655883B1 (en) * 2011-09-27 2014-02-18 Google Inc. Automatic detection of similar business updates by using similarity to past rejected updates
US8892523B2 (en) 2012-06-08 2014-11-18 Commvault Systems, Inc. Auto summarization of content
US20140180974A1 (en) * 2012-12-21 2014-06-26 Fair Isaac Corporation Transaction Risk Detection
US9665621B1 (en) 2013-03-14 2017-05-30 EMC IP Holding Company LLC Accelerated query execution within a storage array
US9275291B2 (en) 2013-06-17 2016-03-01 Texifter, LLC System and method of classifier ranking for incorporation into enhanced machine learning
GB2524074A (en) 2014-03-14 2015-09-16 Ibm Processing data sets in a big data repository
US9977808B2 (en) 2015-06-22 2018-05-22 Sap Se Intent based real-time analytical visualizations
US10540516B2 (en) 2016-10-13 2020-01-21 Commvault Systems, Inc. Data protection within an unsecured storage environment
US10389810B2 (en) 2016-11-02 2019-08-20 Commvault Systems, Inc. Multi-threaded scanning of distributed file systems
US10922189B2 (en) 2016-11-02 2021-02-16 Commvault Systems, Inc. Historical network data-based scanning thread generation
US10331624B2 (en) 2017-03-03 2019-06-25 Transitive Innovation, Llc Automated data classification system
US10642886B2 (en) 2018-02-14 2020-05-05 Commvault Systems, Inc. Targeted search of backup data using facial recognition
WO2020102033A2 (en) * 2018-11-12 2020-05-22 Nant Holdings Ip, Llc Curation and provision of digital content
DE102019108857A1 (en) * 2019-04-04 2020-10-08 Bundesdruckerei Gmbh Automated machine learning based on stored data
KR20220095893A (en) * 2020-12-30 2022-07-07 (주)누리플렉스 Method and apparatus for standardizing heterogeneous data
US20230044288A1 (en) * 2021-08-09 2023-02-09 Infosys Limited Computer implemented system and method of enrichment of data for digital product definition in a heterogenous environment
US11356419B1 (en) 2021-10-01 2022-06-07 Oversec, Uab System and method for retrieving aggregated information about virtual private network servers
WO2023158426A1 (en) * 2022-02-17 2023-08-24 Nokia Solutions And Networks Oy Method and network node for guided network service

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6360216B1 (en) * 1999-03-11 2002-03-19 Thomas Publishing Company Method and apparatus for interactive sourcing and specifying of products having desired attributes and/or functionalities
US6697998B1 (en) * 2000-06-12 2004-02-24 International Business Machines Corporation Automatic labeling of unlabeled text data

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6250930B1 (en) * 1997-05-30 2001-06-26 Picante Communications Corporation Multi-functional communication and aggregation platform
US6564202B1 (en) * 1999-01-26 2003-05-13 Xerox Corporation System and method for visually representing the contents of a multiple data object cluster
US6961728B2 (en) * 2000-11-28 2005-11-01 Centerboard, Inc. System and methods for highly distributed wide-area data management of a network of data sources through a database interface
US6694307B2 (en) 2001-03-07 2004-02-17 Netvention System for collecting specific information from several sources of unstructured digitized data
US7194483B1 (en) * 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
WO2003065179A2 (en) * 2002-02-01 2003-08-07 John Fairweather A system and method for mining data
FI20020414A (en) * 2002-03-04 2003-09-05 Nokia Oyj Mechanism for uncontrolled clustering
US6968338B1 (en) * 2002-08-29 2005-11-22 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Extensible database framework for management of unstructured and semi-structured documents
US20040167908A1 (en) 2002-12-06 2004-08-26 Attensity Corporation Integration of structured data with free text for data mining
US7395256B2 (en) * 2003-06-20 2008-07-01 Agency For Science, Technology And Research Method and platform for term extraction from large collection of documents
US7389306B2 (en) * 2003-07-25 2008-06-17 Enkata Technologies, Inc. System and method for processing semi-structured business data using selected template designs
US7409393B2 (en) 2004-07-28 2008-08-05 Mybizintel Inc. Data gathering and distribution system
US7444325B2 (en) * 2005-01-14 2008-10-28 Im2, Inc. Method and system for information extraction
US7849048B2 (en) 2005-07-05 2010-12-07 Clarabridge, Inc. System and method of making unstructured data available to structured data analysis tools
US7647335B1 (en) * 2005-08-30 2010-01-12 ATA SpA - Advanced Technology Assessment Computing system and methods for distributed generation and storage of complex relational data
US7813919B2 (en) * 2005-12-20 2010-10-12 Xerox Corporation Class description generation for clustering and categorization
US7657506B2 (en) * 2006-01-03 2010-02-02 Microsoft International Holdings B.V. Methods and apparatus for automated matching and classification of data
US7630946B2 (en) * 2006-05-16 2009-12-08 Sony Corporation System for folder classification based on folder content similarity and dissimilarity
US8595245B2 (en) * 2006-07-26 2013-11-26 Xerox Corporation Reference resolution for text enrichment and normalization in mining mixed data
US7634467B2 (en) * 2006-10-31 2009-12-15 Microsoft Corporation Implicit, specialized search of business objects using unstructured text
US7853595B2 (en) * 2007-01-30 2010-12-14 The Boeing Company Method and apparatus for creating a tool for generating an index for a document
US20080189163A1 (en) * 2007-02-05 2008-08-07 Inquira, Inc. Information management system
WO2009009192A2 (en) 2007-04-18 2009-01-15 Aumni Data, Inc. Adaptive archive data management
US8140584B2 (en) * 2007-12-10 2012-03-20 Aloke Guha Adaptive data classification for data mining
US8060536B2 (en) * 2007-12-18 2011-11-15 Sap Ag Managing structured and unstructured data within electronic communications
US8266148B2 (en) * 2008-10-07 2012-09-11 Aumni Data, Inc. Method and system for business intelligence analytics on unstructured data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6360216B1 (en) * 1999-03-11 2002-03-19 Thomas Publishing Company Method and apparatus for interactive sourcing and specifying of products having desired attributes and/or functionalities
US6697998B1 (en) * 2000-06-12 2004-02-24 International Business Machines Corporation Automatic labeling of unlabeled text data

Also Published As

Publication number Publication date
US20110231372A1 (en) 2011-09-22
US20080263029A1 (en) 2008-10-23
US8131684B2 (en) 2012-03-06
US7912816B2 (en) 2011-03-22
WO2009009192A2 (en) 2009-01-15

Similar Documents

Publication Publication Date Title
WO2009009192A3 (en) Adaptive archive data management
Chitchyan et al. Survey of aspect-oriented analysis and design approaches
WO2008088721A3 (en) Querying data and an associated ontology in a database management system
WO2014059342A3 (en) Method for adaptive conversation state management with filtering operators applied dynamically as part of a conversational interface
WO2009117835A8 (en) Search system and method for serendipitous discoveries with faceted full-text classification
CA2899854C (en) Systems and methods for indentifying documents based on citation history
WO2007089274A3 (en) An improved method and apparatus for sociological data analysis
Krzywinski et al. Points of significance: Analysis of variance and blocking.
MX2013014807A (en) Client-side modification of search results based on social network data.
WO2014022345A3 (en) Disambiguating user intent in conversational interactions
WO2009152370A3 (en) Searching using patterns of usage
WO2010141799A3 (en) Feature engineering and user behavior analysis
Lei et al. A research agenda on managerial intention to green it adoption: from norm activation perspective
WO2008052132A3 (en) Pattern-based filtering of query input
WO2009102412A3 (en) Method and system for automated search for, and retrieval and distribution of, information
GB2446072A (en) Methods and systems for generating query and result-based relevance indexes
WO2010031085A3 (en) Document length as a static relevance feature for ranking search results
WO2010016989A3 (en) Context based search arrangement for mobile devices
WO2011159516A3 (en) Semantic content searching
WO2007051067A3 (en) Classification and management of keywords across multiple campaigns
WO2008039542A3 (en) System and method of ad-hoc analysis of data
WO2012064826A3 (en) Suffix array candidate selection and index data structure
WO2008156473A3 (en) Using relevance feedback in face recognition
WO2014008281A3 (en) Query-based software system design representation
WO2008066637A3 (en) Generation of a multidimensional dataset from an associative database

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08826264

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 08826264

Country of ref document: EP

Kind code of ref document: A2