CN104823169B - For the index configurations that can search for data in network - Google Patents
For the index configurations that can search for data in network Download PDFInfo
- Publication number
- CN104823169B CN104823169B CN201380053433.7A CN201380053433A CN104823169B CN 104823169 B CN104823169 B CN 104823169B CN 201380053433 A CN201380053433 A CN 201380053433A CN 104823169 B CN104823169 B CN 104823169B
- Authority
- CN
- China
- Prior art keywords
- data
- data field
- search
- size
- subregion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
- G06F16/1824—Distributed file systems implemented using Network-attached Storage [NAS] architecture
- G06F16/1827—Management specifically adapted to NAS
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2272—Management thereof
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
To be indexed and storage the data that can search for can be uploaded to network service using the entity of computing device.The data may include multiple data fields, and each data field has one or more associations value.The network service can analyze the data field and its respective associated value, to determine the data field type of the data field and will to be allowed for the search option of the data field.It is at least partially based on the data field type and described search option, the network service can produce search index configurations/mode.It is at least partially based on search index configurations/mode of the generation, the network service can produce the search index for the data.In some embodiments, the data conversion can also be indexed mutually compatible format at described search by network service.
Description
Background technique
Computing device is usually used to through network such as Internet traffic.The network-based service provided by service provider
Become more universal.Computing device is frequently used to be connected to network-based service, and the network-based service can provide
Service will be can search for data such as storage by computing device use/retrieval or provide additional processing power to calculating dress
It sets.About the network-based storage that can search for data, the user of computing device usually require as its data option and installment and/or
Format, so that its data can be indexed and be stored by network-based service.Conventional method usually requires user and determines for its number
According to suitable configuration.Conventional method it may also be desirable to the format that the data of user have to comply with, and then require user by its data
It is converted into the format.This may be inconvenient, fiber crops for the user for wanting to be stored and searched for using network-based service
Tired or difficult, and then reduce overall customer experience.
Detailed description of the invention
Each embodiment according to the disclosure is described with reference to the accompanying drawings, in the accompanying drawings:
Fig. 1 shows the exemplary environments of the aspect using each embodiment;
Fig. 2 shows the exemplary system embodiments for the index configurations that can search for data in networked environment;
Fig. 3 is shown using the exemplary web page browsing ring for the index configurations that can search for data in networked environment
Border;
Fig. 4 shows the examplar search that can be generated according to each embodiment and indexes;
Fig. 5 shows the exemplary method embodiment for the index configurations that can search for data in networked environment;
Fig. 6 shows the exemplary means that can be used to realize the aspect of each embodiment;
Fig. 7 shows the example components of client terminal device described device as shown in Figure 6;And
Fig. 8 shows the environment that each embodiment can be achieved.
Specific embodiment
The system and method that description generates index configurations, the index configurations can be used to generate for by least one net
The search of the received data of network indexes.At least some embodiments allow computing device that will count by network (for example, internet)
It is stored in distribution provided by network service (that is, Internet Service Provider) according to uploading to.Network service can be analyzed and be uploaded
Data are with the type (that is, data field type) of the data field of each data field in the multiple data fields of determination.Network
Service can analyze uploaded data to determine whether one or more for each data field in multiple data fields
A search option is included in uploaded data.
At least some embodiments permission computing devices are upload the data to by network (for example, internet) to be taken by network
In storage distribution provided by business (that is, Internet Service Provider, network-based service etc.).One or more user/entities
(for example, using one or more computing devices) can search for uploaded data by network using search index, described to search
Rustling sound, which draws, to be provided by network service.
In some embodiments, the data uploaded may include multiple data fields.Network service can be analyzed and be uploaded
Data are with the type (that is, data field type) of the data field of each data field in the multiple data fields of determination.For example,
Each data field can have the type including integer, text type or literal type.
In addition, network service can analyze uploaded data to determine whether for every number in multiple data fields
It is included in uploaded data according to one or more search options of field.For example, network service can determine for each phase
Answer data field, if allow to include the option in the search index that will be generated by respective data field.Network service is also
It can determine for each respective data field, if allow to calculate the option of the face number of respective data field.In addition, network takes
Business can determine for each respective data field, if allows to will be responsive to search inquiry and returns/provide and respective data word
The option of the associated value of section.
In some embodiments, network service can produce for data field type determined by being at least partially based on and
The index configurations (that is, search index configurations, mode, index setting etc.) of the data for the search option that will allow.Network service
Index configurations can be at least partially based on and generate the search index for being used for data.
It can such as be provided according to each embodiment, be described below and propose various other functions and advantage.
Fig. 1 shows the exemplary environments 100 using the aspect in each embodiment.Exemplary environments 100 may include
At least one computing device 102, network 104 (for example, internet, Intranet, local network, local area network etc.) and network service
106 (that is, Internet Service Provider, network-based services etc.).At least one computing device 102 can be led to by network 104
Letter it is connected to network service 106.In some embodiments, computing device 102 can be in the feelings of no network 104 such as internet
Network service 106 is transmitted under condition.As shown in fig. 1, user 108 or other realities of at least one computing device 102 also may be present
Body (for example, individual, company, tissue, group etc.) 108.User or entity 108 can by network 104 by data 110 from least one
A computing device 102 is transmitted to network and services 106 (and vice versa).
In some embodiments, network service 106 may include and/or utilize the one or more for being connected to network 104
Host or server.For example, memory space can be rented user or another reality to client, such as device 102 by network service 106
Body (for example, company, tissue, group, individual etc.) 108.Therefore, network 104 can be used in user/entity 108 of computing device 102
It will be in the data of device 102 storage to network service 106.In other words, user/entity 108 and/or device 102 can be through
It is stored by network service 106 using network-based calculating.
In an example, computing device 102 can be transmitted by network 104 and will be stored in network services 106
Data 110, as shown in fig. 1.Data 110 can be any data for network-based calculating, such as be used to search for, data
The data of library storage, operation application program, operation virtual machine, operation operating system etc..Computing device 102, which can transmit, to be stored
Data 110 in the storage distribution provided by service 106.For example, user/entity 108 is commercially available or rental service 106 on
Memory space, and store distribution and can be assigned and be assigned to user/entity 108.In some embodiments, user/entity
108 can have particular account and/or storage distribution in service 106;The memory space of entity 108 is assigned to (for example, storage
Distribution) it can be associated with the account of entity 108.
Entity 108 may also hope to network service 106 and provide the search index for being used for data 110.Conventional method usually needs
Want entity 108 first to provide data 110 to be indexed to configuration (that is, index configurations, mode, index setting etc.), or normal
Configuration/format (for example, search data format (SDF)) that rule method may need solid data 110 to have to comply with, therefore it is required that
Its data 110 is converted into required configuration by entity 110.However, this is inconvenient, troublesome or difficult for entity 108
's.
In some embodiments, data 110 can be transferred to network service 106, and network service 106 by entity 108
The index of (for example, being not necessarily to instructions or requests from entity 108) analysis data 110 and generation for data 110 it can match automatically
Set (for example, search index configurations, search indexing model etc.).For example, in some embodiments, network service 106 can pass through
Determination includes the data field type 112 of one or more data fields in data 110 and the use that determination will allow
Data 110 are analyzed in including the search option 114 of one or more data fields in data 110.
About the Class1 12 for determining data field, may be present can be associated with data 110 (for example, document, file etc.)
A variety of data field types, such as data field, the data field of literal type or the data field of text type of integer.
In some embodiments, data 110 may include multiple data fields, and each data field includes value (for example, data field
" title " can have the value of " ABCD- brand shirt ";Data field " price " can have " value etc. of $ 20 ').Network service 106
It can analyze including multiple data fields in data 110, to determine the number of each data field in multiple data fields
According to the type of field.
For example, for each data field, network service 106 can determine whether the value of each respective data field wraps
Include the integer amount more than specified integer amount threshold value (for example, the value of data field " price " is integer entirely);If it is this
Sample, then can determine that respective data field is integer data field type.Network service 106 can also be for example, by determination
It is below at least one determine whether data field is lteral data field type: associated with data field value has super
Cross specified text numerical lower limits value but lower than the alphabetic character quantity of specified text the upper limit of the number value;It is related to data field
The number of the different value of connection is lower than specified text different number threshold value;The percentage of different value is lower than specified text difference hundred
Divide and compares threshold value;Or the length of value is lower than specified word length threshold value.In some embodiments, network service 106 can be such as
The frequency and/or percentage for considering the different value in the length and data field value of data field value are data field to be identified as
Text type;If in data field value there are many different values and data field value it is very long (for example, having more than threshold value
Alphabetic character number), then data field is likely to text type.In some embodiments, if data field is not
Integer type or literal type, then data field can be text type.
About search option 114 is determined, network service 106 can determine that will be allowed is used for 110 (data word of data
Section) one or more search options 114.For example, having determined that the data word including the data field in data 110
When segment type, network service 106 can be determined whether to allow by data field to include the option in the search index that will be generated,
Whether allow to calculate the option of the face number of data field, and/or whether allows return/offer for the search value of data type
Option.
For example, if the data field type of data field is confirmed as text (for example, data field is " product description "
And value is long paragraph), then network service 106 may be selected not include in the option searched in index by data field (and value).
In another example, for the data field with integer data field type (for example, data field is " production year " and value
It is the time) for, network service 106 may be selected to allow by data field to include the option in the search index that will be generated,
And it services 106 to allow to calculate the option of the face number of data field.Face number, which can be, is in data word to how many search result
The counting of a certain classification of section.For example, network service 106 can determine that it is necessary to mention if data field is " production year "
For face number, face number instruction how many search result are associated with a certain classification;For example, " 1984 (23), 2002 (12), 2010
(18) " example for showing the face number about " production year " data field, wherein 23 search results are associated with " 1984 ", 12
Search result is associated with " 2002 " and 18 search results are associated with " 2010 ".
In some embodiments, network service 106 can also decide the value for allowing returned data field.For example, in response to
Searching request, not all data field (and value) needs that can search for are returned (for example, retrieval and presentation).Network service 106
It can decide whether the value of returned data field.
The generation of the configuration for data 110 is turned now to, network service can be automatic (for example, being not necessarily to the finger from entity 108
Enable) generate the configuration (for example, search index configurations, mode etc.) for being used for data 110.In some embodiments, the configuration
Can at least partly assist in how index data 110;The index configurations can at least partly manage incite somebody to action how index data
110.The configuration or mode may specify the data field type including each data field in data 110, and instruction is each
Whether data field can search for, and indicate whether each data field is graduate (for example, classifiable) and can have
Help establish other similar informations of index.After generating the configuration for the data 110 that will be indexed, network service 106
It can produce the search index of the data 110 for configuring produced by being at least partially based on.
Fig. 2 shows using the example web page browsing environment for the index configurations that can search for data in networked environment
200.Example web page browsing environment 200 may include the example web page 202 reproduced by application program such as web browser.At this
In a example, webpage 202 can be provided by network service associated with domain ABCD.com.
For example, user/entity (for example, client of network service) can be retailer and can upload and sell shirt phase
The data of pass.Data can be indexed by network service and store and make it to other people as user/entity potential customers can
Search.Network service can analyze the type that the data include the data field of each data field in data with determination
(that is, data field type).For example, data relevant to the sale of shirt may include data field, such as " color " 206, " greatly
It is small " 208, " price " 210, " description " and other fields.Network service can analyze the value of each data field with each phase of determination
Answer the type of data field.Network service may further determine that the one or more options for being allowed for each data field (for example, searching
Rope option).Network service then can generate configuration/mode of the data for that will index.Then, network service can be based on institute
It states configuration/mode and generates the index for being used for data.
For example, network service can recognize data field " color " and determine the value of the data field (for example, " red
Color ", " blue ", " white ", " green " etc.) it is alphabetical/text, and " color " data field can be identified as text class
Type.(in this example, can be uploaded by entity associated with " color " data field data and be worth (for example, " red ",
" blue ", " white ", " green " etc.)).In another example, network service can recognize at least part of uploaded data
In " size " data field, and it is numerical value that determination, which includes value in " size " data field,.In this example, network
Service can determine that " size " data field is integer type.In another example, network service can recognize uploaded data extremely
The value of " description " data field in few a part, and can determine that described value includes both number and alphabetic character and/or institute
Stating value is that very long and/or described value has different term/phrase/symbols for number of characters.In this example, net
Network service can determine that " description " data field is text type.
About search option, network service be can determine for each of data field, if be allowed corresponding data
Field includes the option in the search index that will be generated.For example, in some embodiments, can be omitted from search index
" description " data field (and respective value).If it is, then the inquiry will when the inquiry that operation is indexed about search
Do not search for " description " data field.However, " description " data field and value can be included in search index by some embodiments.
In addition, network service can be determined whether the option for the face number for allowing to calculate each data field.As mentioned above, face
Number indicates that the result of how many matching search inquiries has the particular value (or range of value) for specific data field.For example, such as
Shown in Fig. 2, " color " data field with " red " value has face number 23 (that is, for 23 search of " red " shirt
As a result), however " blue " value of " color " data field has face number 28 (that is, 28 search results for being directed to " blue " shirt)
Etc..In some embodiments, described value can be overlapped (i.e., it is not necessary to for accurate matching).For example, having blue and red bar
The shirt of line can be associated with " blue " and " red " value and/or has other values.In some embodiments, network service can
It is determined that the face number of some data fields is calculated, but it is not necessary to calculating the face number of all data fields.For example, network service
It can determine the face number that should have " color ", " size " and " price ", the face number of " description " may be not present.
In addition, network service can be determined whether the value for allowing returned data field.For example, can have including in data
Data field " internal product identification number ", the value of the data field be the product ID inside entity and be not intended to by
Show the client of entity;Similarly, network service can determine the value for not allowing to return such data field.
It is contemplated that the additional option and number relevant to other projects that those skilled in the art will appreciate that may be present
According to.For example, network service can be determined whether the option for allowing that data field can be made to classify (for example, classifiable).With reference to figure
2, in some embodiments, " price " data field can be classified/be classified by its value (for example, from lowest price to most
High price, from ceiling price to lowest price etc.), " color " data field can alphabet sequence classification (being not shown in Fig. 2) etc.
Deng.In another example (not shown), data relevant to media file such as music, video, book, photo etc. may be present.Matchmaker
The example data field of body file may include but be not limited to: " title ", " artist/author ", " creation time ", " price ",
" grade " etc..
The type including the data field in data and one for including data field in data has been determined
A or multiple search options, network service can produce the configuration (that is, search index configurations, mode etc.) for data, described to match
The generation set is at least partially based on identified data field type and search option.
After generating configuration, network service can be at least partially based on generated configuration and generate the search rope for being used for data
Draw.Therefore, network service can be utilized by the data that entity provides and deposited by the search index for data that network service generates
Storage.
Fig. 3 shows the exemplary system embodiment 300 for the index configurations that can search for data in networked environment.Show
Example property system implementation plan 300 may include system controller 302, at least one communication transceiver 304, data field type analysis
Device 306, search option analyzer 308, index configurations generator 310, index generator 312 and at least one storage distribution 314.
System controller 302 can promote system and execute for the various of the index configurations that can search for data in networked environment
Operation.System controller 302 can be communicated at least one communication transceiver 304, to promote to one outside system 300 or
In the data transmission in multiple sources and/or the data receiver and promotion system in one or more sources outside system 300
Data communication.
System 300 can be by data field type analyzer via communication transceiver 304 (for example, from entity) received data
306 analyses, to determine and include the associated type of each of data field in data.Data can also be by searching for
Option analyzer 308 is analyzed, to determine whether one or more search options about the data being included in data
Each of field.Search option determined by data field type determined by being at least partially based on and one or more,
Index configurations generator 310 can produce search index configurations/mode.Then, search index caused by being at least partially based on is matched
/ mode is set, index generator 312 can produce the search index for data.It can be searched for by data and for caused by data
Index is stored in one or more storage distribution 314.
It is contemplated that all parts of exemplary system 300 and/or part can be implemented as hardware, software or both
Combination.For example, the various pieces of system 300 can via circuit, processor, application program, a part of procedure code, algorithm or its
Any combination etc. is realized.It is further contemplated that Fig. 3 is example and is intended only for illustrative purpose.For example, all parts are not
It must be configured according to Fig. 3.In some embodiments, all parts need not couple closely to each other, and alternatively spread
In more dispersed system.For example, component such as index generator can reside in individually/heterogeneous networks and/or system, but still
Remain to the communication connection of other components.
Fig. 4 is shown can be according to the examplar search index 400 that each embodiment of the disclosure generates.With reference to Fig. 4, searching
Root node 402 may be present in drawing in rustling sound.In the example in figure 4, data can be uploaded by entity such as T-shirt retailer.Data can be right
The information for the T-shirt (root node 402) that can be used for selling that Ying Yu is manufactured about entity.It is relevant to T-shirt that expression may be present
The father node (for example, 404,406,408) of the data field of data.For example, T-shirt can have color data field 404, size
Data field 406 and price data field 408.
With continued reference to the example of Fig. 4, data field can have the child node (example for indicating the value in each respective data field
Such as, 410,412,414,416,418).For example, can at least exist two kinds of colors (red 410 and blue 412), a kind of size (in
And two kinds of Price Range (< $ 10416 and $ 10- $ 20418) number 414).Last set result/project also may be present (for example, T-shirt
Shirt 420,422,424,426,428,430), described search result/project can correspond to one or more in data field and value
It is a.
In this example, all three data fields (color 404, size 406 and price 408) are intended to be included in search
In index, there can be face number and may be in response to relevant search inquiry and/return value is provided.For example, as shown in Figure 4, face
Color: red 410 can have face number three, and color: blue 412 can have face number two.Size: medium size 414 can have face number two.
Price: < $ 10416 can have face number one, and price: $ 10- $ 20418 can have face number two.In addition, the search inquiry of color:
For example, red 410 will return to T-shirt 422,424 and 428;For example, search red 410 and blue 412, will return to T-shirt 422
Etc..Although examplar search index 400 is shown as tree construction, however, it is contemplated that, it can be in a number of alternative manners and/or sharp
Search index is generated with other structures.
Fig. 5 shows the exemplary method embodiment 500 for the index configurations that can search for data in networked environment.Again
It is secondary, it should be understood that in the range of each embodiment, there may be executed or be performed in parallel by similar or substitution order
Other step, less step or alternative steps, unless otherwise stated.At step 502, exemplary method embodiment 500
It can receive the data that will be indexed.For example, the data that will be indexed that method 500 can be uploaded by entity, and the data can
Including multiple data fields (or at least one data field).In some embodiments, illustrative methods may further determine that and number
According to the title of associated data field.At step 504, illustrative methods 500 can determine data word associated with data
The type of section.For example, the method can determine multiple field classes associated with each data field in multiple data fields
The field type of type.Multiple field types may include in (but being not limited to) integer type, literal type or text type at least
It is a kind of.The type of data field can be determined from the type of multiple data fields.It in some embodiments, can be based on label, letter
Number or it is other instruction to identify multiple data fields and its type and/or title.At step 506, method 500 can determine will
The one or more search options about data field associated with data allowed.For example, one or more of search
Option may include at least one of the following: include the option in the search index that will be generated by respective data field;Meter
It calculates the option of the face number of respective data field or the option of one or more values associated with respective data field is provided.Step
508 may include that the type for being at least partially based on data field and one or more search options are matched come the index for generating for data
It sets.Then, at step 510, method 500 can be at least partially based on the search for generating for the index configurations of data and being used for data
Index.In some embodiments, it can be produced based on whether data are the combinations of structural data, free circumferential edge or both
Raw search index.In some embodiments, data, index configurations or can be by one or more can also be provided in illustrative methods
At least one of the index of search inquiry search.
It may be present including the various other information in index configurations.For example, configuration can be reserved for about data field whether
Can facet (i.e., if the face number of data field should be calculated), whether data field can classify (that is, with data field
Whether search result should be classified) etc. information.
In some embodiments, network service can by reception/upload data conversion in the first format at the second format,
Second format is mutually compatible with search index and the data for being converted into the second format can be stored in one or more and deposits
In storage distribution.For example, network service can receive data from entity, the data can have any one in several various formats
Kind is a variety of, such as .PDF .DOC .DOCX .CSV .JSON .XML.Data can be automatically converted by network service can be with net
The format of road service mutually compatible (for example, can be by ... identification, can be by ... use), such as searches for data format (SDF).
In some embodiments, network service can be based on following operation change data: by the first format and the second format
Compare, and modify associated with the first format at least one data field with corresponding to it is associated with the second format extremely
Few one data word section.For example, network service may compare from entity received data format and modify/upload the lattice
Formula, so that it is mutually compatible with network service.This may include identifying whether to add, remove or be changed to one of the format
Or multiple data fields.
In some embodiments, it is more than specified that network service can be had based on value associated with data field is determined
The integer character amount of integer amount threshold value is integer type come the type for determining data field.In addition, network service can be by true
It is fixed it is below at least one come the type for determining data field be literal type: it is more than finger that value associated with data field, which has,
Fixed text numerical lower limits value but lower than the alphabetic character quantity of specified text the upper limit of the number value;It is associated with data field
The number of different value is lower than specified text different number threshold value;The percentage of different value is lower than specified text different weight percentage
Threshold value;Or the length of value is lower than specified word length threshold value.In addition, network service can determine data based on following person is determined
The type of field is text type: value associated with data field has integer and word more than specified amount of text threshold value
At least one of alphabetic character quantity;The number of kinds of characters is more than specified text different number threshold value;The hundred of kinds of characters
Divide than being more than specified text different weight percentage threshold value;Or the length of character is more than specified text size threshold value.
In some embodiments, network service can determine to allow by data field to include indexing in the search that will be generated
In option, it is described decision be at least partially based on reception include the signal in data field, the signal designation data field
It will be included in search index.Network service can also decide the option for allowing to calculate the face number of data field, and the decision is extremely
It is at least partly based on and determines that the quantity of at least one value associated with data field is more than specified face number lower limit value and is lower than
Specified face number upper limit value.Network service can further determine to allow to provide in response to related search queries and data field phase
Associated value, it includes the signal in data field that the decision, which is at least partially based on reception, the signal designation will provide with
The associated value of data field.
In some embodiments, one or more search inquiries (for example, term in search inquiry) can be taken by network
Business utilizes.For example, network service can be inferred to the searcher just facet in specific data field from search inquiry.Thus, for example,
Network service can determine that data field should be literal type.
In some embodiments, when searcher input inquiry term and request search when, can be by correlation with spy
Determine ranking expression (for example, order of result) and one or more search results are presented.The disclosure allow creation consider it is other because
Element such as inquires the more complicated list of irrelevant factor (for example, may be present including the popularity data field in data)
It reaches.The disclosure may also allow for by check data and determine it is meaningful for data field popularity analyze, so as to
It is proposed workable ranking expression.For example, body of text data field type and its length may be present (for example, or its length
Inverse) can be considered and can for ranking express useful information is provided.
In some embodiments, data field type may also include geographic location type, time type, data type or
Float.
With the consistent each embodiment of the disclosure also using sample data.For example, mention can be first by sample for user/necrosis
Notebook data is provided to network service.Network service can analyze the sample data to determine the type and search choosing of data field
?.Based on data field type and for the search option of sample data, network service can produce index configurations, and subsequent base
Search index is generated in generated index configurations.
Fig. 6 shows the exemplary electronic user apparatus 600 that can be used according to each embodiment.Although showing portable
Computing device (such as E-book reader or tablet computer), it is to be understood that can be according to discussed herein each
A embodiment is using any electronic device that can receive, determine and/or handle input, and wherein described device may include (example
As) desktop computer, notebook computer, personal digital assistant, smart phone, video game console, TV set-top box and
Portable media player.In some embodiments, computing device 600 can be simulator, and operation amplifier such as can be used
The device of device execution signal processing.In this example, computing device 600 has display screen 602, the display screen on front side
It will believe under normal operation to user's (for example, on the computing device on display screen same side) display in face of display screen
Breath.In this example, computing device include at least one video camera 604 or at least one video camera at least one
Static or video image information other image-forming components are captured on the visual field.In some embodiments, computing device may be wrapped only
Containing an image-forming component, and in other embodiments, computing device may include several image-forming components.Each image capture
Element for example can be video camera, charge coupled device (CCD), mobile detection sensor or infrared sensor and many other
Possibility.If there is multiple images capture element on the computing device, described image capture element can be inhomogeneity
Type.In some embodiments, at least one image-forming component may include at least one wide angle optical element such as fish-eye lens, institute
Stating wide angle optical element allows video camera to capture image in wide range such as 180 degree or greater angle.In addition, each image
Capture element may include being configured in extremely rapid succession capture the digital still video camera of subsequent frame, or can capture stream video
Video cameras.
Exemplary computing devices 600 further include at least one microphone 606 or can capture audio data (such as device users
If saying or order) other audio capturing devices.In this example, microphone 606 is placed on device and display
Shield on 602 identical sides, so that the microphone usually can be if preferably acquisition equipment user says.At least some
In embodiment, microphone can be shotgun microphone, and the shotgun microphone is essentially directly captured from the front of microphone
Acoustic information, and the sound of limited quantity is only picked up from other directions.It should be understood that microphone is in different embodiments
In can be positioned on the edge of any suitable surface in any region, face or device, and the multiple microphone can be used for
Audio recording and filtering purpose etc..
Exemplary computing devices 600 further include at least one orientation sensor 608, such as position and/or mobile determining element.
Sensors with auxiliary electrode may include for example being operable to the orientation of detection computing device and/or changing in orientation of and the small movement of device
Accelerometer or gyroscope.Orientation sensor may also include electronics or digital compass, and the electronics or digital compass can indicate to fill
Set the direction (such as north or south) for being determined pointed (such as relative to main shaft or other such directions).Orientation sensor may be used also
Including or including global positioning system (GPS) or it is operable to determine the relative coordinate of the position of computing device and the phase of device
To the similar setting element of the information moved greatly.Each embodiment may include that any of this one or more class component is suitble to
Combination.As should be appreciated that, algorithm or mechanism for determining relative position, orientation and/or movement can at least partly be taken
Certainly in the selection for the element that can be used for device.
Fig. 7 shows one group of universal component of exemplary computing devices 700 (as relative to device 600 described in Fig. 6)
Logic arrangement.In this example, device includes the place for executing the instruction being storable in memory device or element 704
Manage device 702.As the apparent for skilled in the art, device may include the memories of many types, data storage or
Nonvolatile computer readable storage medium, the first data storage of the program instruction such as executed for processor 702, for scheming
The SAM Stand Alone Memory of picture or data, for the removable memory etc. with other device shared informations.Described device usually will packet
Some type of display element 706 is included, such as touch screen or liquid crystal display (LCD), but such as the device of portable media player
Information may be transmitted via other mechanisms (such as passing through audio tweeter).As discussed, the device in many embodiments
To include can be at least one image capture element 708 that other objects near projected image or device are imaged, such as
Video camera or infrared sensor.The method of capture image or video is carried out using computing device, using video camera element at this
It is also known in field and will not be discussed in detail herein.It will be appreciated that single image, multiple images, week can be used
The imaging of phase property, consecutive image capture, image stream etc. execute image capture.In addition, device may include starting and/or stopping image
The ability of capture, such as when receiving order from user, application program or other devices.Exemplary means, which are similarly included, to be operated
To capture at least one audio capturing component 712 of audio-frequency information, such as monophonic or stereo from least one Main way
Microphone array.Microphone can be unidirectional or omnidirectional microphone known to such device.
In some embodiments, the computing device 700 of Fig. 7 may include one or more communication device (not shown), such as
Wi-Fi, bluetooth, RF, wired or wireless communication system.Device in many embodiments can with network such as Internet traffic, and
And it can be communicated with other such devices.In some embodiments, device may include can be received from user it is conventional defeated
At least one the additional input device entered.This conventional input is for example including button, touch tablet, touch screen, steering wheel, manipulation
Bar, keyboard, mouse, keypad or user can be by any other such devices or element being used to device input order.
However, in some embodiments, this device may not include any button, thereby increases and it is possible to only can be via vision and audio
The combination of order controls so that user can without being contacted with device control device.
Device 700 may also include at least one orientation or motion sensor 710.As discussed, sensors with auxiliary electrode can wrap
The accelerometer or gyroscope or electronics or digital compass for being operable to detection orientation and/or changing in orientation of are included, it is described to indicate
Device be determined towards direction.Mechanism can also (or alternatively) include or including global positioning system (GPS) or can operate
Come determine computing device position relative coordinate and device relatively large movement information similar setting element.Device
It may include other elements, such as can allow position finding by triangulation or another such method.These mechanism can be with processor
702 communications, and then any of many action described herein or proposed can be performed in device.
As example, device of the computing device as described in relative to Fig. 6 can capture and/or track whithin a period of time user
Various information.This information may include any suitable information, such as position, action (for example, send message or creation file),
User behavior is (for example, how long user executes time quantum, the user's browser interface of a subtask, user effort in task
Mode etc.), customer parameter (for example, user likes how receiving information), the request opening application program, being submitted, connect
The calling etc. of receipts.As explained above, the information can by link information or other associations it is such in a manner of store, in turn
Any suitable dimension or dimension group access information can be used in user.
As discussed, distinct methods can be realized in various environment according to the embodiment.For example, Fig. 8 is shown
For realizing according to the embodiment of the environment 800 of the various aspects of each embodiment.As will be appreciated, although making for explanatory purposes
With network-based environment, but each embodiment can be optionally realized using varying environment.System includes E-customer
End device 802, the electronic client device may include being operable to send and receive request, message on suitable network 804
Or information and any appropriate means that information is sent back to device users.The example of such client terminal device includes individual calculus
Machine, mobile phone, hand-held messaging device, laptop computer, set-top box, personal digital assistant, E-book reader etc..Net
Network may include any suitable network comprising intranet, internet, Cellular Networks, local area network or any other such network or on
State the combination of network.Component used in such system can be at least partially dependent on the type of selected network and/or environment.For
It is well-known via the agreement and component of such network communication, thus is no longer discussed in detail herein.Communication on network can
To be realized via wired or wireless connection and combinations thereof.In this embodiment, network includes internet, because environment includes
It, can for receiving request and the in response to the request Web server 806 of service content, however for other networks
Using the replacement device for servicing similar purpose, as the skilled person will be apparent.
Shown environment includes at least one apps server 808 and data storage 810.It should be appreciated that can
With in the presence of can be chained up or in other ways come configure several apps servers, layer or other elements, process or
Component, these apps servers, layer or other elements, process or component can interact to execute and such as store from suitable data
The task of device acquisition data.As it is used herein, term " data storage ", which refers to, can store, access and acquisition data
Any device or device combination, described device may include data server, database, data storage device and data storage medium
Any combination and any number.Apps server may include any suitable hardware and software, the hardware and software view
It executes the needs of the aspect of the one or more application program of client terminal device and data storage integrates and handle applications
Most of data access and service logic.Apps server provides the access control service to cooperate with data storage,
And such as text that will be transmitted to user, picture, audio and/or video content can be generated, in this embodiment institute
Stating content can be provided services to the user in the form of the suitable structured language of HTML, XML or another by Web server.It is all
Content delivery between request and the disposition and client terminal device 802 and apps server 808 of response can be taken by network
Device 806 be engaged in dispose.It should be appreciated that network server and apps server are not necessary, and only exemplary group
Part, because structured code discussed herein can be on any suitable device or host as noted elsewhere herein
It executes.
Data storage 810 may include several independent tables of data, database or other data storage mechanisms and medium, use
To store data relevant to particular aspects.For example, shown data storage includes generating data 812 for storing
With the mechanism of user information 816, the mechanism can be used for the content at service creation end.Data storage is also shown as including being used for
The mechanism of storage record or session data 814.It should be understood that, it is understood that there may be many being stored in data storage may be needed
Other aspects, such as page image information and access right information, the aspect can be optionally stored in mechanism listed above
In any mechanism or in the additional mechanism that is stored in data storage 810.Data storage 810 can be by associated with it
Logic operates, and instructs to receive from apps server 808, and obtains data in response to described instruction, updates
Data handle data in other ways.In an example, user can submit searching request for certain type of element.
In this situation, data storage may access user information to verify the identity of user, and may have access to catalog detail information
To obtain the information of the element in relation to the type.Then information can such as be returned to use in the form of the results list on webpage
Family, user can check the list via the browser on user apparatus 802.It can be in the private pages or window of browser
In check the information of interested particular element.
Each server is typically included operating system, and the operating system provides the general management for being used for the server
With the executable program instructions of operation, and each server is typically included the computer-readable medium of store instruction, the finger
Enabling can make server execute its expectation function in the processor execution by server.Operating system be suitable for realizing mode and
The general utility functions of server are well-known or commercially available, and are easy to be realized by those of ordinary skill in the art, especially
It is to be realized according to disclosure herein.
In one embodiment, environment is distributed computing environment, and the environment utilizes via communication link, uses one
A or multiple computer networks are directly connected to several computer systems and component to interconnect.However, ordinary skill
Personnel should be understood that this system can be same smooth in the system with component more less or more than component illustrated in fig. 8
Ground operation.Therefore, the description of the system 800 in Fig. 8 substantially should be regarded as illustrative, and not limit the scope of the present disclosure.
As discussed above, each embodiment can be implemented in broad range of operating environment, the environment is one
May include under a little situations one or more subscriber computers, computing device or can be used for operating in multiple application programs any one
The processing unit of a application program.User or client terminal device may include any universal personal in multiple general purpose personal computers
Computer such as the desktop computer or notebook computer of operation standard operating system, and runs mobile software and can
Support cellular devices, wireless device and the hand-held device of multiple network connections and message-sending protocol.This system may also include multiple
Work station, the work station run various commercially available operating systems and for a specific purpose (as develop and data base administration)
Any application program in other known application program.These devices may also include other electronic devices, such as virtual terminal, slim
Client, game system and can be via other devices of network communication.
The also implementable part at least one service or Web service of various aspects, as that can be service orientation type frame structure
Part.If transmitting for any suitable type can be used to communicate for the service of Web service, such as by using being in extensible markup language
(XML) message of format, and exchanged using suitable agreements such as such as SOAP (originating from " Simple Object Access Protocol ").This kind of clothes
The process that business is provided or executed can be write with any suitable language, such as web service description language (WSDL).Use the languages such as such as WSDL
Client code automatically generates functionality in such as each SOAP framework of speech permission.
Most of embodiments are the various markets of known use to those skilled in the art using at least one
The network of either one or two of upper commercially available agreement for supporting communication, the agreement such as TCP/IP, OSI, FTP, UPnP, NFS,
CIFS and AppleTalk.For example, network can be local area network, wide area network, Virtual Private Network, internet, intranet, external connection
Any combination of net, public switch telephone network, infrared network, wireless network and above-mentioned network.
In the embodiment using Web server, Web server can run various servers or mid-tier application
In any application program, including HTTP server, ftp server, CGI server, data server, java server and industry
Business apps server.The server can also respond the request from user apparatus and execute program or script, such as logical
Cross execution one or more may be embodied as one or more with any programming language (such asC, C# or C++) or it is any
The web application of scripting language (such as Perl, Python or TCL) and combinations thereof the script write or program.The server is also
It may include database server, including but not limited to these are commercially available
With
Environment may include various data storages as discussed above and other memories and storage medium.These are situated between
Matter can reside in various positions, as one or more computers local (and/or residing in one or more computers)
Any one of computer on storage medium, or on separate network or all computers.In specific group embodiment, letter
Breath may reside in storage area network familiar to those skilled in the art (" SAN ").Similarly, can it is optionally local and/
Or long-range storage is for executing any necessary file for the function of belonging to computer.In the feelings that system includes the device of computer
Under condition, every kind of such device may include the hardware element that can be electrically coupled via bus, and the element includes, for example, at least
One central processing unit (CPU), at least one input unit (for example, mouse, keyboard, controller, touch screen or keypad) and
At least one output device (for example, display equipment, printer or loudspeaker).Such system may also include one or more and deposit
Storage device, such as disc driver, optical storage and solid-state storage device, such as random access memory (" RAM ") or read-only
Memory (" ROM ") and removable medium device, memory card, flash card etc..
Such device may also include computer-readable storage media reader, communication device (such as modem, network
Block (wirelessly or non-wirelessly), infrared communications set etc.) and working storage, as discussed above.Computer-readable storage medium
Matter reader can connect or be configured to receive computer readable storage medium with computer readable storage medium, to indicate remote
Journey, local, fixation and/or mobile storage means and for interim and/or more permanently contain, store, transmit and capture
The storage medium of computer-readable information.System and various devices generally also include multiple software applications, module, service or
Other elements at least one working storage device, including operating system and application program, such as client application journey
Sequence or Web browser.It is to be appreciated that alternate embodiment can have numerous variations compared to embodiment as described above.
For example, it is possible to use custom hardware and/or particular element may be implemented in hardware, software (including portable software, it is such as small
Program) or hardware and software in.Furthermore, it is possible to using the connection with such as other computing devices of network inputs/output device.
Storage medium and computer-readable medium containing code or partial code may include known in the art or used
Any suitable medium, including storage medium and communication media, such as (but not limited to) by storing and/or transmitting information (as based on
Calculation machine readable instruction, data structure, program module or other data) any method or technique in the volatibility implemented and non-
Volatibility, removable and irremovable medium, including RAM, ROM, EEPROM, flash memory or other memory technologies, CD-ROM, number
Word general optic disc (DVD) or other optical memory, magnetic holder, tape, magnetic disk storage or other magnetic storage devices, or it is available
In any other medium for storing wanted information and being accessed for system and device.Based on disclosure provided in this article and religious doctrine, originally
Technical field those of ordinary skill will be appreciated that the other way and/or method for realizing each embodiment.
Therefore, the description and the appended drawings should be understood in descriptive sense rather than in restrictive sense.However, it will be evident that
: in the case where not departing from the broader spirit and scope of the invention as illustrated in the claims, it can be done
Various modifications and change out.
The various embodiments of the disclosure can be described in view of following clause:
A1. a kind of computer implemented method for the index configurations that can search for data in networked environment comprising:
The data that will be indexed are received, the data include multiple data fields;
Determine title associated with each data field in the multiple data field;
Determine the field class in multiple field types associated with each data field in the multiple data field
Type, the multiple field type include at least one of integer type, literal type or text type;
Determine whether one or more search options for each of the data field, it is one or
Multiple search options include at least one of the following: including the choosing in the search index that will be generated by respective data field
?;Calculate the option of the face number of the respective data field;Or provide one or more associated with the respective data field
The option of a value;
It is at least partially based on the field type including each data field in the data and whether allows institute
The determination of one or more search options is stated to generate the search index configurations for the data;And
The described search index configurations for the data are at least partially based on to generate the search rope for the data
Draw.
A2. the computer implemented method as described in clause A1, wherein the data are the first formats, further include:
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search
Hold;And
The data for being converted into second format are stored in one or more storage distribution.
A3. the computer implemented method as described in clause A2, wherein the data are converted into institute from first format
Stating the second format includes:
First format is compared with second format;And
Modification corresponds to related with second format at least one associated data field of first format
At least one data field of connection.
A4. the computer implemented method as described in clause A2, wherein second format is search data format (SDF).
A5. a kind of computer implemented method comprising:
The data that will be indexed are received,
The type for determining data field associated with the data, determines the data word from multiple data field types
The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described
The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A6. the computer implemented method as described in clause A5, wherein the data are the first formats, further include:
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search
Hold;And
The data for being converted into second format are stored in one or more storage distribution.
A7. the computer implemented method as described in clause A6, wherein the data are converted into institute from first format
Stating the second format includes:
First format is compared with second format;And
Modification corresponds to related with second format at least one associated data label of first format
At least one data label of connection.
A8. the computer implemented method as described in clause A5, wherein the multiple data field type include integer type,
At least one of text type, literal type, geographic location type, time type, data type or float.
A9. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine that value associated with the data field has the integer character quantity for being higher than specified integer amount threshold value;
And
The type for determining the data field is the integer type.
A10. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine at least one of the following: value associated with the data field, which has, is higher than specified amount of text threshold
The alphanumeric character quantity of value;Different value number associated with the data field is higher than specified text different number threshold
Value;The high percentage of different value is in specified text different weight percentage threshold value;Or the length of value is higher than specified text size threshold
Value;And
The type for determining the data field is the text type.
A11. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine it is below at least one: value associated with the data field, which has, is higher than specified text numerical lower limits
Value but the alphanumeric character quantity for being lower than specified text the upper limit of the number value;Different value associated with the data field
Number is lower than specified text different number threshold value;The percentage of different value is lower than specified text different weight percentage threshold value;Or
The length of value is lower than specified word length threshold value;And
The type for determining the data field is the literal type.
A12. the computer implemented method as described in clause A5, wherein one or more of search options may include following
At least one of: it include the option in the described search index that will be generated by the data field;Calculate the data
The option of the face number of field;Or the option of value associated with the data field is provided in response to related search queries.
A13. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches
Rope option includes determining to allow to include the option in the described search index that will be generated by the data field, described
It determines to be at least partially based on the signal received include in the data field or determines that the type of the data field is text
At least one of type, data field described in the signal designation will be included in described search index.
A14. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches
Rope option include determine allow to calculate the data field face number the option, it is described decision be at least partially based on determine with
The quantity of multiple values of the associated distribution of data field is lower than specified face number upper limit value.
A15. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches
Rope option includes determining to allow to provide described value associated with the data field in response to the related search queries
The option, the decision are at least partially based on the signal received include in the data field or determination and the data word
Lower than specified at least one of return value length threshold, the signal designation will provide the length of the associated described value of section
Described value associated with the data field.
A16. the computer implemented method as described in clause A5, further include:
There is provided the data, the index configurations or by can be by the index that one or more search inquiries are searched for
At least one.
A17. the computer implemented method as described in clause A5, further include:
One or more Client-initiated inputs are at least partially based on to modify the index configurations.
A18. a kind of system comprising:
At least one communication transceiver;
One or more storage distribution;
At least one processor;And
Memory device comprising cause the system to perform the following operation when being executed by least one described processor
Instruction:
The data that will be indexed are received via at least one described communication transceiver;
The type for determining data field associated with the data, determines the data word from multiple data field types
The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described
The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A19. the system as described in clause A18, wherein the data are the first formats, and wherein described instruction leads to institute
State system further,
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search
Hold;And
The data for being converted into second format are stored in one or more of storage distribution.
A20. the system as described in clause A19, wherein described instruction causes the system based on following operation come will be described
Data are converted into second format from first format: first format is compared with second format;And
It is associated at least with second format to correspond to modify at least one data field associated with first format
One data word section.
A21. a kind of nonvolatile computer readable storage medium comprising the instruction of element, described instruction exist for identification
The computing system is caused to perform the following operation when being executed by the processor of computing system:
The data that will be indexed are received,
The type for determining data field associated with the data, determines the data word from multiple data field types
The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described
The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A22. the nonvolatile computer readable storage medium as described in clause A21, wherein the multiple data field type
Including in integer type, text type, literal type, geographic location type, time type, data type or float
It is at least one.
A23. the nonvolatile computer readable storage medium as described in clause A22, wherein described instruction leads to the calculating
System is literal type based at least one of the following is determined come the type for determining the data field: with the data
The associated value of field, which has, is higher than specified text numerical lower limits value but lower than the alphabetical number of specified text the upper limit of the number value
Word character quantity;The number of different value associated with the data field is lower than specified text different number threshold value;It is different
The percentage of value is lower than specified text different weight percentage threshold value;Or the length of value is lower than specified word length threshold value.
A24. the nonvolatile computer readable storage medium as described in clause A21, wherein one or more of search are selected
Item includes at least one of the following: including the option in the described search index that will be generated by the data field;Meter
Calculate the option of the face number of the data field;Or value associated with the data field is provided in response to related search queries
Option.
A25. the nonvolatile computer readable storage medium as described in clause A24, wherein determination will allow described one
A or multiple search options include the option for determining to allow to calculate the face number of the data field, and the decision is at least partly
Referred to based on determining that the quantity of at least one value associated with the data field is higher than specified face number lower limit value and is lower than
Fixed face number upper limit value.
B1. a kind of computer implemented method for News Search subregion comprising:
Monitor stored data quantity or on the first subregion provided by network service in the rate of operation data
At least one, first subregion is included in the storage distribution provided by the network service;
Detect in the quantity or the rate it is described at least one be respectively more than specified amount threshold or specified
Rate-valve value;
The size for increasing by first subregion is executed in response to the detection or at least the second subregion is added to described deposit
At least one of storage distribution, at least one of the increase or the addition are at least partially based on the institute of stored data
State the rate of quantity or operation data;
It, will network flow associated with the storage distribution described in the increase or the addition during at least one
Amount is directed to the cache memory provided by the network service;And
When it is described increase or the addition described at least one execution complete when, the network flow is directed to institute
State storage distribution.
B2. the computer implemented method as described in clause B1, further include:
Search index of the monitoring for the storage distribution;
The size for detecting described search index is more than specified index size threshold value;And
Update for it is described storage distribution described search index with reflect about it is described storage distribution the increase or
In the addition it is described at least one.
B3. the computer implemented method as described in clause B1, wherein if the size of first subregion is lower than most
Big partition size threshold, then executing the increase to the size of first subregion, and wherein if described first point
The size in area is at the maximum partition size threshold, then executing the addition of at least described second subregion.
B4. a kind of computer implemented method comprising:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has many subregions comprising
At least one subregion;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger
Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing
It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute
State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution
The part.
B5. the computer implemented method as described in clause B4, further include:
The size for detecting the search index for the storage distribution is more than specified index size threshold value;And
It based on the size for modifying at least one subregion or include in the number of partitions in the storage distribution
It is described at least one come update for it is described storage distribution described search index.
B6. the computer implemented method as described in clause B5, wherein updating described search index includes rebuilding for described
The described search index of storage distribution is to reflect the size of at least one subregion described in the modification or be included in described deposit
Storage distribution in number of partitions in it is described at least one.
B7. the computer implemented method as described in clause B4, wherein it includes being stored in the storage that the data, which use,
Data bulk in distribution or at least one of the rate in the upper operation data of storage distribution.
B8. the computer implemented method as described in clause B7, wherein the specified threshold include specified amount threshold or
At least one of specified rate-valve value, and be more than wherein the specified quantity threshold when stored data quantity occurs
When the rate of value or operation data is more than at least one in the specified rate-valve value, the data use more than institute
State specified threshold.
B9. such as the computer implemented method of clause B8, wherein at least it is based partially on the letter used about historical data
Breath is to calculate the specified threshold.
B10. such as computer implemented method of clause B4, further include:
Determine that the network flow quantity for being directed into the storage distribution is higher than specified flow threshold;And
The storage distribution is modified based on the network flow quantity.
B11. such as computer implemented method of clause B10, wherein the network flow includes being stored in for searching for
The search inquiry flow of data in the storage distribution.
B12. such as computer implemented method of clause B10, wherein being modified based on the network flow quantity described
Storage distribution includes at least one of the following: modifying the size of at least one subregion;Modify the number of partitions;
It or at least one subregion replacement with different size include at least one subregion in the number of partitions.
B13. such as the computer implemented method of clause B12, wherein the different size includes different cpu powers, no
With RAM capacity, different at least one of hard drive space capacity or different bandwidth capacity.
B14. such as the computer implemented method of clause B4, wherein modify at least one subregion the size or
At least one of described number of partitions includes in the size or the number of partitions for increase at least one subregion
At least one, wherein executing if the size of at least one subregion is lower than maximum partition size threshold to institute
State the increase of the size of at least one subregion, and wherein if the size of at least one subregion it is described most
At big partition size threshold, then executing the increase to the number of partitions.
B15. such as the computer implemented method of clause B4, wherein modify at least one subregion the size or
At least one of described number of partitions includes in the size or the number of partitions for reduce at least one subregion
At least one, wherein executing the reduction to the number of partitions, and its if the number of partitions is greater than a subregion
In if the number of partitions is a subregion, execute the reduction to the size of at least one subregion.
B16. the computer implemented method as described in clause B4, further include:
Determine that the CPU of the storage distribution is used, wherein modifying at least one of the size or the number is base
At least one of used in the CPU for the determination that the data in the storage distribution use or the storage distributes.
B17. the computer implemented method as described in clause B4, further include:
It is described based on being modified with the data using at least one of associated configuration or Client-initiated input
Store the configuration of distribution.
B18. the computer implemented method as described in clause B4, further include:
Distribute obtainable resource based on the storage determine when to execute in the size or the number extremely
Few one modification.
B19. a kind of system comprising:
Storage distribution, the storage distribution have many subregions comprising at least one subregion;
At least one processor;And
Memory device comprising cause the system to perform the following operation when being executed by least one described processor
Instruction:
Monitor that the data in the storage distribution use;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger
Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing
It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute
State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution
The part.
B20. the system as described in clause B19, further include:
At least one load balancer, be configured to facilitate in modifying the size or the number described in extremely
The network flow is directed away from the part of the storage distribution during one few, and facilitates to the size or institute
State the part that network flow when at least one the modification in number is completed is directed into the storage distribution.
B21. the system as described in clause B20, wherein at least one described load balancer is configured to guide the net
It includes many subregions in the storage distribution that network flow, which passes through,.
B22. the system as described in clause B19, further include:
At least one monitor module is configured to help to monitor that the data in the storage distribution use,
And aid in determining whether include in the storage distribution described in data at least one subregion using whether being more than
Specified threshold.
B23. a kind of nonvolatile computer readable storage medium of the instruction including element for identification, described instruction by
The processor of computing system causes the computing system to perform the following operation when executing:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has many subregions comprising
At least one subregion;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger
Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing
It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute
State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution
The part.
B24. the nonvolatile computer readable storage medium as described in clause B23, wherein described instruction leads to the calculating
System searches for the size indexed more than specified index size threshold value for the storage distribution further, detecting, and
It based on the size for modifying at least one subregion or include the institute in the number of partitions in the storage distribution
At least one is stated to update the described search index for the storage distribution.
B25. the nonvolatile computer readable storage medium as described in clause B24, wherein update described search index includes
Rebuild for it is described storage distribution described search index with reflect at least one subregion described in the modification the size or
Including it is described storage distribution in number of partitions in described at least one.
Claims (15)
1. a kind of computer implemented method comprising:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has is permitted including at least one subregion
Multi partition;
Determination includes whether the data at least one subregion described in described store in distribution use more than specified threshold
Value;
Modify the size of at least one subregion;
Network flow associated with the storage distribution is guided in separate storage distribution and is repaired with described in the size
Change associated part;And
When the modification is completed, the network flow is directed to associated with the modification described in the storage distribution
Part.
2. computer implemented method as described in claim 1, further include:
The size for detecting the search index for the storage distribution is more than specified index size threshold value;And
The described search index for the storage distribution is updated based on the size for modifying at least one subregion.
3. computer implemented method as claimed in claim 2, wherein updating described search index includes rebuilding to deposit for described
The described search of storage distribution indexes the size to reflect at least one subregion described in the modification.
4. computer implemented method as described in claim 1, wherein it includes being stored in the storage point that the data, which use,
The data bulk mixed or at least one of the rate in the upper operation data of storage distribution.
5. computer implemented method as claimed in claim 4, wherein the specified threshold includes specified amount threshold or refers to
At least one of fixed rate-valve value, and be more than wherein the specified amount threshold when stored data quantity occurs
Or the rate of operation data, when being more than at least one in the specified rate-valve value, the data use more than described specified
Threshold value.
6. computer implemented method as claimed in claim 5 is wherein at least based partially on the information used about historical data
To calculate the specified threshold.
7. computer implemented method as described in claim 1, further include:
Determine that the network flow quantity for being directed into the storage distribution is higher than specified flow threshold;And
The storage distribution is modified based on the network flow quantity.
8. computer implemented method as claimed in claim 7, wherein the network flow is described including being stored in for search
The search inquiry flow of data in storage distribution.
9. computer implemented method as claimed in claim 7, wherein modifying the storage based on the network flow quantity
Distribution includes at least one of the following: modifying the size of at least one subregion;Modify number of partitions;Or with having
At least one subregion replacement of different size includes at least one subregion in number of partitions.
10. computer implemented method as claimed in claim 9, wherein the different size includes different cpu powers, difference
RAM capacity, different at least one of hard drive space capacity or different bandwidth capacity.
11. computer implemented method as described in claim 1, if wherein the size for modifying at least one subregion includes
The size of at least one subregion is lower than maximum partition size threshold, increases the size of at least one subregion.
12. computer implemented method as described in claim 1, wherein the size for modifying at least one subregion includes reducing
The size of at least one subregion, and wherein if number of partitions is a subregion, execute to it is described at least one
The reduction of the size of subregion.
13. computer implemented method as described in claim 1, further include:
Determine that the CPU of the storage distribution is used, wherein modifying the size is based on the data in the storage distribution
Using or the identified CPU of the storage distribution at least one of use.
14. computer implemented method as described in claim 1, further include:
Based on modifying the storage with the data using at least one of associated configuration or Client-initiated input
The configuration of distribution.
15. computer implemented method as described in claim 1, further include:
Obtainable resource is distributed based on the storage to determine when to execute the modification to the size.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811424497.4A CN110096502A (en) | 2012-10-12 | 2013-10-12 | Implementation method, system and the medium of the index configurations that can search for data in network |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/650,961 | 2012-10-12 | ||
US13/650,931 US9507750B2 (en) | 2012-10-12 | 2012-10-12 | Dynamic search partitioning |
US13/650,961 US9047326B2 (en) | 2012-10-12 | 2012-10-12 | Index configuration for searchable data in network |
US13/650,931 | 2012-10-12 | ||
PCT/US2013/064731 WO2014059394A1 (en) | 2012-10-12 | 2013-10-12 | Index configuration for searchable data in network |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811424497.4A Division CN110096502A (en) | 2012-10-12 | 2013-10-12 | Implementation method, system and the medium of the index configurations that can search for data in network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104823169A CN104823169A (en) | 2015-08-05 |
CN104823169B true CN104823169B (en) | 2018-12-21 |
Family
ID=50477970
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811424497.4A Pending CN110096502A (en) | 2012-10-12 | 2013-10-12 | Implementation method, system and the medium of the index configurations that can search for data in network |
CN201380053433.7A Active CN104823169B (en) | 2012-10-12 | 2013-10-12 | For the index configurations that can search for data in network |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811424497.4A Pending CN110096502A (en) | 2012-10-12 | 2013-10-12 | Implementation method, system and the medium of the index configurations that can search for data in network |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP2907034A4 (en) |
JP (2) | JP2015532493A (en) |
KR (2) | KR101737246B1 (en) |
CN (2) | CN110096502A (en) |
AU (3) | AU2013328901B2 (en) |
BR (1) | BR112015008146A2 (en) |
CA (1) | CA2888116C (en) |
IN (1) | IN2015DN03160A (en) |
SG (2) | SG11201502828PA (en) |
WO (1) | WO2014059394A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10289603B2 (en) | 2012-10-12 | 2019-05-14 | Amazon Technologies, Inc. | Dynamic search partitioning |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9047326B2 (en) | 2012-10-12 | 2015-06-02 | A9.Com, Inc. | Index configuration for searchable data in network |
CN106131189A (en) * | 2016-07-15 | 2016-11-16 | 柳州健科技有限公司 | The network platform based on LAN |
CN106131188A (en) * | 2016-07-15 | 2016-11-16 | 柳州健科技有限公司 | LAN system |
CN105978913A (en) * | 2016-07-15 | 2016-09-28 | 柳州健科技有限公司 | Network service system |
CN105979016A (en) * | 2016-07-15 | 2016-09-28 | 柳州健科技有限公司 | Local area network data service system |
CN105978739A (en) * | 2016-07-15 | 2016-09-28 | 柳州健科技有限公司 | Network data platform based on local area network |
CN105979015A (en) * | 2016-07-15 | 2016-09-28 | 柳州健科技有限公司 | Network data service platform based on local area network |
CN105979014A (en) * | 2016-07-15 | 2016-09-28 | 柳州健科技有限公司 | Network data system based on local area network |
CN106101024A (en) * | 2016-07-16 | 2016-11-09 | 柳州健科技有限公司 | There is the LAN data system of data monitoring function |
CN106131196A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | The network system with self-learning function based on LAN |
CN106131190A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | The network platform with data monitoring function based on LAN |
CN106131191A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | There is the LAN data service system of data monitoring function |
CN106060082A (en) * | 2016-07-16 | 2016-10-26 | 柳州健科技有限公司 | Local area network-based network service platform with data monitoring function |
CN106131193A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | There is the local area network services platform of self-learning function |
CN106060081A (en) * | 2016-07-16 | 2016-10-26 | 柳州健科技有限公司 | Network service platform with data monitor function |
CN106131195A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | There is the LAN system of data monitoring function |
CN106131192A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | The network system with data monitoring function based on LAN |
CN106131194A (en) * | 2016-07-16 | 2016-11-16 | 柳州健科技有限公司 | There is the LAN platform of self-learning function |
CN106060083A (en) * | 2016-07-16 | 2016-10-26 | 柳州健科技有限公司 | Network service system with data monitoring function |
CN107977381B (en) * | 2016-10-24 | 2021-08-27 | 华为技术有限公司 | Data configuration method, index management method, related device and computing equipment |
CN110019191A (en) * | 2017-09-21 | 2019-07-16 | 阿里巴巴集团控股有限公司 | Database information processing method and processing device |
CN108881147B (en) * | 2017-12-29 | 2019-07-05 | 视联动力信息技术股份有限公司 | A kind of data processing method and device of view networking |
CN110134661A (en) * | 2019-05-22 | 2019-08-16 | 东北大学 | A kind of academic big data storage querying method towards facet |
CN112306604B (en) * | 2020-08-21 | 2022-09-23 | 海信视像科技股份有限公司 | Progress display method and display device for file transmission |
US11658917B2 (en) | 2021-04-09 | 2023-05-23 | Tekion Corp | Selective offloading of bandwidth to enable large-scale data indexing |
CN117596176B (en) * | 2024-01-17 | 2024-04-19 | 苏州元脑智能科技有限公司 | Network state measuring method, device, equipment and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7788233B1 (en) * | 2007-07-05 | 2010-08-31 | Amazon Technologies, Inc. | Data store replication for entity based partition |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1143349A1 (en) * | 2000-04-07 | 2001-10-10 | IconParc GmbH | Method and apparatus for generating index data for search engines |
US7716168B2 (en) * | 2005-06-29 | 2010-05-11 | Microsoft Corporation | Modifying table definitions within a database application |
US7457910B2 (en) * | 2005-06-29 | 2008-11-25 | Sandisk Corproation | Method and system for managing partitions in a storage device |
US8341345B2 (en) * | 2005-08-08 | 2012-12-25 | International Business Machines Corporation | System and method for providing content based anticipative storage management |
US7668825B2 (en) * | 2005-08-26 | 2010-02-23 | Convera Corporation | Search system and method |
JP4772569B2 (en) * | 2006-04-07 | 2011-09-14 | 株式会社日立製作所 | System and method for performing directory unit migration in a common namespace |
US8214345B2 (en) * | 2006-10-05 | 2012-07-03 | International Business Machines Corporation | Custom constraints for faceted exploration |
JP5218060B2 (en) * | 2006-10-06 | 2013-06-26 | 日本電気株式会社 | Information retrieval system, information retrieval method and program |
US8965807B1 (en) * | 2007-05-21 | 2015-02-24 | Amazon Technologies, Inc. | Selecting and providing items in a media consumption system |
US20100011368A1 (en) * | 2008-07-09 | 2010-01-14 | Hiroshi Arakawa | Methods, systems and programs for partitioned storage resources and services in dynamically reorganized storage platforms |
JP4762289B2 (en) * | 2008-10-01 | 2011-08-31 | 株式会社日立製作所 | A storage system that controls allocation of storage areas to virtual volumes that store specific pattern data |
US9996572B2 (en) * | 2008-10-24 | 2018-06-12 | Microsoft Technology Licensing, Llc | Partition management in a partitioned, scalable, and available structured storage |
CN102405460B (en) * | 2009-02-11 | 2016-05-04 | 艾梵尼达有限公司 | Virtual storage system and operation method thereof |
US8250026B2 (en) * | 2009-03-06 | 2012-08-21 | Peoplechart Corporation | Combining medical information captured in structured and unstructured data formats for use or display in a user application, interface, or view |
US20110131202A1 (en) * | 2009-12-02 | 2011-06-02 | International Business Machines Corporation | Exploration of item consumption by customers |
US8930332B2 (en) * | 2010-03-12 | 2015-01-06 | Salesforce.Com, Inc. | Method and system for partitioning search indexes |
WO2011118427A1 (en) * | 2010-03-24 | 2011-09-29 | 日本電気株式会社 | Query device, query division method, and program for query division |
US8190593B1 (en) * | 2010-04-14 | 2012-05-29 | A9.Com, Inc. | Dynamic request throttling |
WO2012020471A1 (en) * | 2010-08-10 | 2012-02-16 | 株式会社日立製作所 | Computer system management method and management system |
WO2012072879A1 (en) * | 2010-11-30 | 2012-06-07 | Nokia Corporation | Method and apparatus for updating a partitioned index |
US8495331B2 (en) * | 2010-12-22 | 2013-07-23 | Hitachi, Ltd. | Storage apparatus and storage management method for storing entries in management tables |
US8620897B2 (en) * | 2011-03-11 | 2013-12-31 | Microsoft Corporation | Indexing and searching features including using reusable index fields |
-
2013
- 2013-10-12 BR BR112015008146A patent/BR112015008146A2/en not_active Application Discontinuation
- 2013-10-12 IN IN3160DEN2015 patent/IN2015DN03160A/en unknown
- 2013-10-12 JP JP2015536973A patent/JP2015532493A/en active Pending
- 2013-10-12 SG SG11201502828PA patent/SG11201502828PA/en unknown
- 2013-10-12 CN CN201811424497.4A patent/CN110096502A/en active Pending
- 2013-10-12 CN CN201380053433.7A patent/CN104823169B/en active Active
- 2013-10-12 KR KR1020157012030A patent/KR101737246B1/en active IP Right Grant
- 2013-10-12 AU AU2013328901A patent/AU2013328901B2/en not_active Ceased
- 2013-10-12 EP EP13845541.5A patent/EP2907034A4/en not_active Withdrawn
- 2013-10-12 WO PCT/US2013/064731 patent/WO2014059394A1/en active Application Filing
- 2013-10-12 SG SG10201606363SA patent/SG10201606363SA/en unknown
- 2013-10-12 KR KR1020177012552A patent/KR101782302B1/en active IP Right Grant
- 2013-10-12 CA CA2888116A patent/CA2888116C/en active Active
-
2016
- 2016-09-20 AU AU2016231488A patent/AU2016231488B2/en not_active Ceased
- 2016-10-28 JP JP2016211567A patent/JP6339155B2/en active Active
-
2017
- 2017-10-12 AU AU2017245374A patent/AU2017245374B2/en not_active Ceased
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7788233B1 (en) * | 2007-07-05 | 2010-08-31 | Amazon Technologies, Inc. | Data store replication for entity based partition |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10289603B2 (en) | 2012-10-12 | 2019-05-14 | Amazon Technologies, Inc. | Dynamic search partitioning |
Also Published As
Publication number | Publication date |
---|---|
CA2888116A1 (en) | 2014-04-17 |
AU2017245374A1 (en) | 2018-01-18 |
IN2015DN03160A (en) | 2015-10-02 |
CA2888116C (en) | 2018-03-27 |
AU2016231488B2 (en) | 2017-09-21 |
KR101782302B1 (en) | 2017-09-26 |
AU2013328901A1 (en) | 2015-05-14 |
KR20150066575A (en) | 2015-06-16 |
KR101737246B1 (en) | 2017-05-17 |
JP6339155B2 (en) | 2018-06-06 |
KR20170054579A (en) | 2017-05-17 |
WO2014059394A1 (en) | 2014-04-17 |
CN104823169A (en) | 2015-08-05 |
JP2017050012A (en) | 2017-03-09 |
AU2013328901B2 (en) | 2016-07-28 |
BR112015008146A2 (en) | 2017-07-04 |
EP2907034A1 (en) | 2015-08-19 |
AU2016231488A1 (en) | 2016-10-06 |
EP2907034A4 (en) | 2016-05-18 |
CN110096502A (en) | 2019-08-06 |
AU2017245374B2 (en) | 2018-08-09 |
SG11201502828PA (en) | 2015-05-28 |
JP2015532493A (en) | 2015-11-09 |
SG10201606363SA (en) | 2016-09-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104823169B (en) | For the index configurations that can search for data in network | |
US11595477B2 (en) | Cloud storage methods and systems | |
US9922360B2 (en) | Recommendations system | |
CN104704522B (en) | Recommend native applications | |
US9372901B2 (en) | Searching for software applications based on application attributes | |
US9411839B2 (en) | Index configuration for searchable data in network | |
US10135940B2 (en) | Subscribing to event notifications using object instances | |
US9223902B1 (en) | Architectures for content identification | |
KR20160138261A (en) | Infrastructure for synchronization of mobile device with mobile cloud service | |
US10848434B2 (en) | Performance management for query processing | |
KR20160119185A (en) | Cloud service custom execution environment | |
US8645554B2 (en) | Method and apparatus for identifying network functions based on user data | |
US11882154B2 (en) | Template representation of security resources | |
US10075508B2 (en) | Application-centric socialization | |
AU2019284230A1 (en) | Multi-source data analytics system, data manager and related methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |