Multi-user multi-language mail translation preprocessing system and method
Technical Field
The invention relates to a multi-language mail translation processing system, in particular to a multi-user multi-language mail translation preprocessing system and a multi-user multi-language mail translation preprocessing method.
Background
Email is an important means by which people communicate to and from today. With globalization and the increasingly frequent association of people of various countries, people are increasingly involved in cross-country email communication in their life and work. Thus, a need for multilingual email translation has arisen. However, it is well known that the mass production of spam can be annoying to email users. In addition, there are some mail that is not to be translated, and after translation by the translator, the translator finds mail that is not desired by the customer, or that is repeatedly translated. The mail translation is wasted in time, material and human resources. Therefore, it is desirable to provide a multilingual mail translation preprocessing system.
Disclosure of Invention
The invention aims to solve the technical problem of providing a multi-user multi-language mail translation preprocessing system to overcome the defect of resource waste of translation time, materials and manpower in the existing mail translation work.
In order to solve the technical problem, the multi-user multi-language mail translation preprocessing system comprises a preprocessing server system and a client system, wherein the preprocessing server system is deployed on a server, and the preprocessing client system is deployed on a personal machine;
the preprocessing service end system comprises: the system comprises a mail receiving module, a filtering processing module, a mail distributing module, a first storage area and a second storage area; the first storage area is used for storing a first data table, the first data table comprises a plurality of data records, and each record at least comprises a receiver account, an interpreter account and an information field of languages to be translated;
the mail receiving module is used for receiving external mails;
the filtering processing module is used for reading the external mails one by one and acquiring the languages and the account information of the addressees of the mails; then inquiring in the first data table, finding out the record of the account of the receiver and the mail language matching, and obtaining the account information of the translator to be distributed by the mail through the record;
the filtering processing module is further used for reading the title MD5 value and the content MD5 value of the external mail one by one when the external mail is read one by one, and storing the title MD5 value, the content MD5 value and the recipient account information as a record in the second storage area; when reading the next external mail, respectively matching the title MD5 value, the MD5 value of the content and the recipient account information of the current mail with the records in the second storage area one by one, and marking the current mail as a repeated mail if the same records exist; otherwise, the account information of the translator to be distributed of the mail and the mail are sent to a mail distribution module;
the mail distribution module is used for receiving the external mails sent by the filtering processing module and the corresponding translator account information one by one and forwarding the external mails to the pre-processing client system logged by the translator account to be distributed;
and the front-end processing client system is used for logging in a translator and receiving the external mail forwarded by the mail distribution module after logging in the translator.
The scheme of further optimization is that the multi-user multi-language mail translation preprocessing system also comprises a filtering rule database, wherein the filtering rule database comprises records of a plurality of mail filtering rules; each mail filtering rule record comprises: matching objects, matching rules, matching priorities and mail marking types, wherein the matching pairs comprise a mail sender, a mail receiver and a mail title;
after the filtering processing module reads an external mail, firstly acquiring a sender, a receiver and a title of the mail, then reading the filtering rule database, performing matching search in the filtering rule database by taking the sender, the receiver and the title of the mail as matching objects according to the matching rule recorded by each mail filtering rule, and if two or more matching rule records are found, selecting the filtering rule with the highest matching priority as a target filtering rule; if only one matched rule record is found, taking the filter rule as a target filter rule; reading the mail marking type contained in the target filtering rule, marking the external mail, and then reading the next external mail; and if the matched rule record is not found, performing subsequent processing.
Furthermore, when the filtering processing module obtains the language of the mail, the language judgment module is called, and the language judgment module judges the language of the mail by combining the title and the content of the mail and feeds back the language to the filtering processing module.
Preferably, the record field in the second storage area further includes time information, and the filtering processing module matches only records within a certain set time when the header MD5 value, the MD5 value of the content, and the recipient account information of the current mail are respectively matched with the records in the second storage area one by one.
Preferably, the system further comprises a cache cleaning module, the record field in the second storage area further includes time information, and the cache cleaning module is configured to periodically clean the record before the set time in the second storage area according to the time information field in the second storage area.
The invention also provides a processing method of the multi-user multi-language mail translation preprocessing system, which comprises the following steps:
receiving external mails;
reading the external mails one by one to obtain the languages of the mails and the account information of a receiver; then inquiring in the first data table, finding out the record of the account of the receiver and the mail language matching, and obtaining the account information of the translator to be distributed by the mail through the record;
when the external mails are read one by one, reading the title MD5 value of the external mail and the MD5 value of the content, and storing the title MD5 value, the MD5 value of the content and the recipient account information as a record in the second storage area; when reading the next external mail, respectively matching the title MD5 value, the MD5 value of the content and the recipient account information of the current mail with the records in the second storage area one by one, and marking the current mail as a repeated mail if the same records exist; otherwise, the account information of the translator to be distributed of the mail and the mail are sent to a mail distribution module;
receiving the external mails sent by the filtering processing module and the corresponding translator account information one by one, and forwarding the external mails to the pre-processing client system where the translator account to be distributed logs in;
and the translator logs in the front-end processing client system and receives the external mail forwarded by the mail distribution module after logging in.
The more optimized scheme is that the server-side system is also provided with a filtering rule database. After the filtering processing module reads an external mail, firstly acquiring a sender, a receiver and a title of the mail, then reading the filtering rule database, performing matching search in the filtering rule database by taking the sender, the receiver and the title of the mail as matching objects according to the matching rule recorded by each mail filtering rule, and if two or more matching rule records are found, selecting the filtering rule with the highest matching priority as a target filtering rule; if only one matched rule record is found, taking the filter rule as a target filter rule; reading the mail marking type contained in the target filtering rule, marking the external mail, and then reading the next external mail; and if the matched rule record is not found, performing subsequent processing.
And when the language of the mail is obtained, judging the language of the mail by combining the title and the content of the mail.
After the translator logs in the front-end processing client system, the front-end processing client system provides a working interface for receiving mails and translating the mails for the translator.
Preferably, the record field in the second storage area further includes time information, and when the value of the title MD5 of the current mail, the value of the MD5 of the content, and the recipient account information are respectively matched with the records in the second storage area one by one, only the records in a certain set time are matched.
Preferably, the record field in the second storage area further includes time information, and the record before the set time in the second storage area is periodically cleared according to the time information field in the second storage area.
Has the advantages that:
the invention accurately and directionally distributes the external mails to the corresponding translators through the filtering rules, the judgment of the repeated mails and the filtering of several gateways of language filtering, simultaneously blocks the trouble of the repeated invalid mails to the translators, and greatly improves the working efficiency of the translators.
Drawings
The technical solution of the present invention will be further specifically described with reference to the accompanying drawings and the detailed description.
Fig. 1 is a block diagram of the structure of an embodiment of the present invention.
Fig. 2 is a process flow diagram of a pre-processing server system according to an embodiment of the present invention.
FIG. 3 is a process flow diagram of a pre-processing client system in accordance with an embodiment of the present invention.
Fig. 4 is a diagram illustrating the operation of a filter rule record customized by the filter rule base.
Detailed Description
For the mail translation system, the mail needing to be translated by a user is correctly distinguished, and meanwhile, the resource waste of the unnecessary translated mail to a translator is greatly reduced. Which is the underlying object of the present invention.
As shown in fig. 1, the structural block diagram of the present invention is as follows:
the method comprises the following steps: the system comprises a mail receiving module, a filtering processing module, a mail distributing module, a first storage area, a second storage area and a filtering rule database;
the first storage area stores a first data table, the first data table comprises a plurality of data records, and each record at least comprises a receiver account, a translator account and an information field of languages to be translated;
the filtering rule database comprises records of a plurality of mail filtering rules; each mail filtering rule record comprises: matching objects, matching rules, matching priorities and mail marking types, wherein the matching pairs comprise a mail sender, a mail receiver and a mail title; the filtering rules are customized and maintained by an administrator according to empirical data.
The mail receiving module is used for receiving external mails;
the filtering processing module is used for reading the external mails one by one, filtering out mails which do not need to be translated and repeated mails according to a filtering rule, and finding out a translator account corresponding to the mails which need to be translated;
the method specifically comprises the following steps: after reading an external mail, a filtering processing module firstly acquires a sender, a receiver and a title of the mail, then reads the filtering rule database, performs matching search in the filtering rule database by taking the sender, the receiver and the title of the mail as matching objects according to the matching rule recorded by each mail filtering rule, and selects the filtering rule with the highest matching priority as a target filtering rule if two or more matching rule records are searched; if only one matched rule record is found, taking the filter rule as a target filter rule; reading the mail marking type contained in the target filtering rule, marking the external mail, and then reading the next external mail; if the matched rule record is not found, subsequent processing is carried out;
acquiring the language of an external mail and account information of a receiver; then inquiring in the first data table, finding out the record of the account of the receiver and the mail language matching, and obtaining the account information of the translator to be distributed by the mail through the record; when the filtering processing module reads the external mails one by one, the filtering processing module also reads the title MD5 value of the external mail and the MD5 value of the content, and stores the title MD5 value, the MD5 value of the content and the recipient account information as a record in the second storage area; when reading the next external mail, respectively matching the title MD5 value, the MD5 value of the content and the recipient account information of the current mail with the records in the second storage area one by one, and marking the current mail as a repeated mail if the same records exist; otherwise, the recorded translator account information and the mail are sent to a mail distribution module.
The mail distribution module is used for receiving the external mails sent by the filtering processing module and the corresponding translator account information one by one and forwarding the external mails to the pre-processing client system logged in by the corresponding translator account;
and the front-end processing client system is used for logging in a translator and receiving the external mail forwarded by the mail distribution module after logging in the translator.
The invention comprises a preprocessing server-side system and a client-side system. Wherein,
as shown in fig. 2, the processing flow of the pre-processing service end system is as follows:
receiving external mails;
after the filtering processing module reads an external mail, the sender, the receiver and the title of the mail are firstly obtained, and then the filtering rule database is read. The filtering rule database comprises records of a plurality of mail filtering rules; one mail filtering rule record as shown in fig. 4 includes: rule name, exclusion rule, matching object, matching rule, matching priority, system label, i.e. mail marking type. Matching pairs comprise a mail sender, a mail receiver and a mail title; if the rule of exclusion is "yes" or "no", it indicates that the rule is applicable. If "no," this indicates that the rule is invalid and is no longer applicable. The matching rule is as follows: sender-exact match, recipient-exact match, title-fuzzy match.
According to the matching rules, the sender, the receiver and the title of the mail are taken as matching objects, matching search is carried out in a filtering rule database, and if two or more matched rule records are found, the filtering rule with the highest matching priority is selected as a target filtering rule; if only one matched rule record is found, taking the filter rule as a target filter rule;
in a more preferred embodiment, before rule matching, the filtering rule database called by the system switching area is a database sorted by using matching priority as an index, and records are sorted from high to low in priority. And when matching searching is carried out in the filtering rule database by taking the sender, the receiver and the title of the mail as matching objects according to the matching rules, the found first filtering rule record is the target filtering rule. This is clearly advantageous to increase the speed of regular filtering. The mail mark type contained in the target filtering rule is read, and the mail mark type comprises that the mail is marked as advertisement mail or customer inquiry mail or exhibition mail or system mail for a specific sender. And after filtering and labeling the external mails, filtering the external mails, and not distributing the external mails to a translator for processing. The next external mail is then read and processed the same way. If the matched rule record is not found, the subsequent processing is carried out as follows:
acquiring the language of an external mail and account information of a receiver; then inquiring in the first data table, finding out the record of the account of the receiver and the mail language matching, and obtaining the account information of the translator to be distributed by the mail through the record; when the filtering processing module reads the external mails one by one, the filtering processing module also reads the title MD5 value of the external mail and the MD5 value of the content, and stores the title MD5 value, the MD5 value of the content and the recipient account information as a record in the second storage area; when reading the next external mail, respectively matching the title MD5 value, the MD5 value of the content and the recipient account information of the current mail with the records in the second storage area one by one, and marking the current mail as a repeated mail if the same records exist; otherwise, the recorded translator account information and the mail are sent to a mail distribution module.
The record field in the second storage area also includes time information, and when the value of the title MD5 of the current mail, the value of the MD5 of the content and the recipient account information are respectively matched with the records in the second storage area one by one, only the records in a certain set time are matched.
As another alternative, the record before the set time in the second storage area is periodically cleared according to the time information field in the second storage area. When the title MD5 value, the MD5 value of the content, and the recipient account information of the current mail are respectively matched with the records in the second storage area one by one, all the records are still matched.
And the mail distribution module receives the external mails sent by the filtering processing module and the corresponding translator account information one by one and distributes the external mails to the pre-processing client system logged in by the corresponding translator account.
Secondly, as shown in fig. 3, on the pre-processing client system, the translator logs in the pre-processing client system, and receives the external mail forwarded by the mail distribution module after logging in.
After the translator logs in the front-end processing client system, the front-end processing client system provides a working interface for receiving mails and translating the mails for the translator. For the translator to translate the work.
Therefore, through the filtration of several gateways of early-stage rule filtration, repeated mail judgment and language filtration, mails which do not need to be translated are generally delivered to the content processing module, invalid mails which need to be processed by a human translator are greatly reduced, the efficiency of the translator is improved, and clients can timely receive the processed mails.
Finally, it should be noted that the above-mentioned embodiments are only for illustrating the technical solutions of the present invention and not for limiting, and although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, which should be covered by the claims of the present invention.