IN2013MU03472A - - Google Patents
Info
- Publication number
- IN2013MU03472A IN2013MU03472A IN3472MU2013A IN2013MU03472A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A IN 3472MU2013 A IN3472MU2013 A IN 3472MU2013A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A
- Authority
- IN
- India
- Prior art keywords
- file
- indexing
- segments
- index
- nodes
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
ABSTRACT INDEXING OF FILE IN A HADOOP CLUSTER A file indexing system (102) for indexing a file to be stored onto a distributed file system (104) includes a segmentation module (122) to segment the file into a plurality of segments. The file indexing system (102) further includes an index generation module (124) to initiate indexing of the file through a plurality of nodes of a Hadoop cluster, where each of the plurality of nodes indexes one or more segments from amongst the plurality of segments to generate at least one index corresponding to the one or more segments. The file indexing system (102) further includes an index transfer module (126) to store the at least one index onto the distributed file system (104). <To be published with Figure 1>
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN3472MU2013 IN2013MU03472A (en) | 2013-10-31 | 2013-10-31 | |
US14/498,598 US9846702B2 (en) | 2013-10-31 | 2014-09-26 | Indexing of file in a hadoop cluster |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN3472MU2013 IN2013MU03472A (en) | 2013-10-31 | 2013-10-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
IN2013MU03472A true IN2013MU03472A (en) | 2015-07-24 |
Family
ID=52996626
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
IN3472MU2013 IN2013MU03472A (en) | 2013-10-31 | 2013-10-31 |
Country Status (2)
Country | Link |
---|---|
US (1) | US9846702B2 (en) |
IN (1) | IN2013MU03472A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106294721A (en) * | 2016-08-08 | 2017-01-04 | 无锡天脉聚源传媒科技有限公司 | A kind of company-data statistics and deriving method and device |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104834730B (en) * | 2015-05-15 | 2018-06-01 | 北京京东尚科信息技术有限公司 | data analysis system and method |
US9961068B2 (en) | 2015-07-21 | 2018-05-01 | Bank Of America Corporation | Single sign-on for interconnected computer systems |
CN105354251B (en) * | 2015-10-19 | 2018-10-30 | 国家电网公司 | Electric power cloud data management indexing means based on Hadoop in electric system |
CN105868253A (en) * | 2015-12-23 | 2016-08-17 | 乐视网信息技术(北京)股份有限公司 | Data importing and query methods and apparatuses |
CN105740727A (en) * | 2016-02-02 | 2016-07-06 | 上海斐讯数据通信技术有限公司 | Distributed storage method and system of private data |
US20200126010A1 (en) * | 2016-06-15 | 2020-04-23 | Solix Technologies, Inc. | Enterprise Business Record Management System |
CN106294842A (en) * | 2016-08-19 | 2017-01-04 | 浪潮(北京)电子信息产业有限公司 | A kind of data interactive method, platform and distributed file system |
CN106487582A (en) * | 2016-09-21 | 2017-03-08 | 努比亚技术有限公司 | A kind of method and apparatus of deployment search server |
CN106776929A (en) * | 2016-11-30 | 2017-05-31 | 北京锐安科技有限公司 | A kind of method for information retrieval and device |
CN106649800A (en) * | 2016-12-29 | 2017-05-10 | 南威软件股份有限公司 | Solr-based Chinese search method |
CN106844700A (en) * | 2017-02-03 | 2017-06-13 | 山东浪潮商用系统有限公司 | It is a kind of to ask tax system based on Sorl |
CN107066595A (en) * | 2017-04-19 | 2017-08-18 | 济南浪潮高新科技投资发展有限公司 | A kind of many application searches method of servicing of big data and system |
CN107273515A (en) * | 2017-06-21 | 2017-10-20 | 国网内蒙古东部电力有限公司信息通信分公司 | Power grid data asset resource retrieval and display based on polymorphic data indexing technology |
US10936681B2 (en) * | 2017-08-03 | 2021-03-02 | International Business Machines Corporation | Generalized search engine for abstract data types with skimming and approximate retrieval |
US11194804B2 (en) | 2017-12-05 | 2021-12-07 | Walmart Apollo, Llc | System and method for an index search engine |
US11392544B2 (en) * | 2018-02-06 | 2022-07-19 | Samsung Electronics Co., Ltd. | System and method for leveraging key-value storage to efficiently store data and metadata in a distributed file system |
US11748495B2 (en) * | 2018-11-28 | 2023-09-05 | Jpmorgan Chase Bank, N.A. | Systems and methods for data usage monitoring in multi-tenancy enabled HADOOP clusters |
US11294938B2 (en) | 2019-01-03 | 2022-04-05 | International Business Machines Corporation | Generalized distributed framework for parallel search and retrieval of unstructured and structured patient data across zones with hierarchical ranking |
CN109766360A (en) * | 2019-01-09 | 2019-05-17 | 北京一览群智数据科技有限责任公司 | A kind of list screening method and device |
CN110297971B (en) * | 2019-05-30 | 2022-09-20 | 百度在线网络技术(北京)有限公司 | Personalized resource retrieval method, device, equipment and computer readable storage medium |
US20220277054A1 (en) * | 2021-02-26 | 2022-09-01 | State Farm Mutual Automobile Insurance Company | Data migration of search indexes across search-engine deployments |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008082441A1 (en) * | 2006-12-29 | 2008-07-10 | Prodea Systems, Inc. | Display inserts, overlays, and graphical user interfaces for multimedia systems |
US8082258B2 (en) * | 2009-02-10 | 2011-12-20 | Microsoft Corporation | Updating an inverted index in a real time fashion |
US20110196854A1 (en) * | 2010-02-05 | 2011-08-11 | Sarkar Zainul A | Providing a www access to a web page |
US20120030018A1 (en) * | 2010-07-28 | 2012-02-02 | Aol Inc. | Systems And Methods For Managing Electronic Content |
US8650159B1 (en) * | 2010-08-26 | 2014-02-11 | Symantec Corporation | Systems and methods for managing data in cloud storage using deduplication techniques |
US9092151B1 (en) * | 2010-09-17 | 2015-07-28 | Permabit Technology Corporation | Managing deduplication of stored data |
CN108664555A (en) * | 2011-06-14 | 2018-10-16 | 慧与发展有限责任合伙企业 | Deduplication in distributed file system |
US20150112996A1 (en) * | 2013-10-23 | 2015-04-23 | Microsoft Corporation | Pervasive search architecture |
-
2013
- 2013-10-31 IN IN3472MU2013 patent/IN2013MU03472A/en unknown
-
2014
- 2014-09-26 US US14/498,598 patent/US9846702B2/en active Active
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106294721A (en) * | 2016-08-08 | 2017-01-04 | 无锡天脉聚源传媒科技有限公司 | A kind of company-data statistics and deriving method and device |
Also Published As
Publication number | Publication date |
---|---|
US20150120695A1 (en) | 2015-04-30 |
US9846702B2 (en) | 2017-12-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
IN2013MU03472A (en) | ||
IN2015DN03160A (en) | ||
MX2015008570A (en) | Modifying structured search queries on online social networks. | |
IL252772B (en) | Generating card stacks with queries on online social networks | |
MX347812B (en) | Using inverse operators for queries on online social networks. | |
PH12016500957A1 (en) | Data management for connected devices | |
WO2014179145A3 (en) | Drive level encryption key management in a distributed storage system | |
MX353716B (en) | Structured search queries based on social-graph information. | |
NZ754204A (en) | Object tracking system optimization and tools | |
SA515360346B1 (en) | Method for operating an arrangement for storing thermal energy | |
MX369047B (en) | Systems and methods for mapping and routing based on clustering. | |
GB2525788A (en) | Data synchronization | |
IN2012DE01073A (en) | ||
GB2514275A (en) | Identifying and ranking solutions from multiple data sources | |
ES2722408T3 (en) | A wind power plant, and a method to increase the reactive power capacity of a wind power plant | |
IN2013MU03094A (en) | ||
MX361879B (en) | Thematic repositories for transaction management. | |
WO2015167427A3 (en) | Data distribution based on network information | |
MX356937B (en) | Contact aggregation in a social network. | |
ES2596662A1 (en) | Electrical distribution network (Machine-translation by Google Translate, not legally binding) | |
MX363282B (en) | Ambiguous structured search queries on online social networks. | |
CA2912019C (en) | Systems and methods for generating issue networks | |
MX346840B (en) | Vertical-based query optionalizing. | |
BR112016023520A2 (en) | temperature management in battery arrangements | |
WO2015069378A8 (en) | Hierarchical distribution of control information in a massively scalable network server |