Modern computer translation systems. · Thirdly, computer dictionaries provide a quick search for dictionary entries: “quick typing”, when a list of similar words appears in the process of typing a word; access to frequently used words by bookmarks; perhaps

Home > Lesson

Machine translation systems. The current globalization of our world leads to the need to exchange documents between people and organizations located in different countries of the world and speaking different languages. Under these conditions, the use of traditional manual translation technology hinders the development of interethnic contacts. Manual translation of multi-page documentation requires a long time and high wages for translators. Translation of an e-mail received or a Web page viewed in a browser must be done immediately, and there is no opportunity and time to invite a translator. Machine translation systems solve these problems. On the one hand, they are capable of translating multi-page documents at high speed (one page per second) and, on the other hand, translating Web pages on the fly, in real time. PROMT and Socrates are considered the best among Russian machine translation systems. Machine translation systems translate texts based on the formal "knowledge" of the language (the syntax of the language - the rules for constructing sentences, the rules of word formation) and the use of dictionaries. The translator program first analyzes the text in one language and then constructs this text in another language. Modern machine translation systems make it possible to translate technical documentation, business correspondence and other specialized texts with sufficient quality. However, they are not applicable to the translation of works of art, since they are not capable of adequately translating metaphors, allegories and other elements of human artistic creativity. Questions:

    Why do we need programs - translators? How are computer dictionaries built? What texts should not be translated using computer translators?
III.Practical part. Now let's practice working with the computer-aided text translation program - Socrates Personal. This is a program from a series of software products developed in Russia and released by the Russian company ARSENAL. The Russian Office series of programs includes the well-known Lexicon text editor and other programs. Arsenal Company allows the work of an unregistered copy of Socrates Personal 4.0 in trial mode for 21 days. You can register the program on the company's website. Socrates - translation system from English into Russian and vice versa Automatically takes into account the subject of the text You can translate a document while maintaining all the formatting features. Socrates can be used in conjunction with the FineReader text recognition system to quickly capture documents from a scanner and translate them. The Socrates Internet version translates WWW pages directly in the browser. Wandering the Net, you can see both the original text of Web pages in English, German and French, as well as translation into Russian with the design preserved. The personal version of the Socrates system "understands" only English; and can also serve as an English-Russian and Russian-English dictionary. It is also very convenient for translating messages of built-in help systems (HELP) of English-language programs Socrates-Dictionary is an online translator's assistant equipped with two-way Russian-English and German-English dictionaries Can be additionally equipped with numerous dictionaries for translating from one European language to another Professional translation system Socrates - the most powerful product that allows you to use additional thematic dictionaries and create your own, expanding vocabulary. In addition to the basic English-Russian version, there is also a version with support for German and French. Run the program: Start→All Programs→Translators→Russian Office→Socrates Personal. Turn on Dictionary mode. With the program window open, just click on the Dictionary tab. The working area is divided into three parts: the dictionary base window, the input field, the translation window. You can get the translation of a word from the dictionary in the following ways: - enter the word in the input field. Moving through the vocabulary database is carried out as letters are entered, until the maximum possible match is obtained. - paste a word into the input field from the Clipboard. In this case, a quick transition will be made to the word that has the maximum possible match with the selected one. - select a previously translated word from the history window of the input field. In this case, a quick transition will be made to the word that has the maximum possible match with the entered one. - select a word in another application and hold down the Shift key and right-click on the selection. A pop-up window will display the translation of the selected word. - use a combination of Hot Keys, having previously placed the necessary word in the Clipboard. Now enable Translator mode. With the program window open, just click on the Translator tab. The Translator window is divided into 2 parts: the source window (top) and the translation window (bottom). Type (or copy) the text in the upper window of the Translator tab. To translate the entered text, click the Translate button on the toolbar or select the Translate command from the Translation menu. After that, the translation will appear in the bottom window of the bookmark. Translation results can be saved to a file or placed on the Clipboard. The program can also translate text from a file. To do this, select Open from the File menu in the program window. A file open dialog will appear. Open the file you need. The program will automatically perform the translation by placing the source text in the upper window of the Translator tab, and its translation in the lower one. In addition to the above features, the program provides translation of text from the clipboard, translation of text in help, quick translation of text in other applications ... You can learn how to use these features from the help system. To exit the program, give the command File→Exit. When you close the program with the close button, the program remains running and its icon is placed on the Windows Taskbar. Now open the file C:\Our Lesson\Lesson 25 Practice.doc and translate the words and text. Students complete the task. IV. D / s Know what automatic text translation programs are, be able to work with these programs. Additional task: connect to the Internet and use any on-line translator to translate the text. V. Students' questions. Answers to students' questions. VI.Summary of the lesson. Summing up the lesson. Grading. At the lesson, we got acquainted with programs for computer translation of texts, learned how to translate words and text using the Socrates Personal program. Lesson number 26. Topic: "Algorithms". Lesson Objectives:
    help students learn the concept of an algorithm, properties of algorithms, types of algorithms. education of information culture of students, attentiveness, accuracy, discipline, perseverance. development of algorithmic thinking, cognitive interests, computer skills.
Equipment: board, computer, computer presentation. Lesson plan:
    Org. moment. (1 min) Updating knowledge. (5 min) Theoretical part. (15 min) Practical part. (10 min) D / s (2 min) Questions from students. (5 min) Summary of the lesson. (2 minutes)
During the classes: I. Org. moment. Greeting, checking those present. Explanation of the lesson. II. Knowledge update. To solve most problems, there are many ready-made programs. But in order to better understand everything that happens with a computer and confidently make the right decisions, an ordinary user needs to have a certain computer literacy. It should be noted that most editors (for example, Microsoft Office Word, Excel) have built-in programming tools, having mastered which you can significantly expand your capabilities. III. Theoretical part. The emergence of algorithms is associated with the birth of mathematics. More than 1000 years ago (in 825), a scientist from the city of Khorezm Abdullah (or Abu Jafar) Muhammad bin Musa al-Khwarizmi created a book on mathematics, in which he described ways to perform arithmetic operations on multi-valued numbers. The very word algorithm originated in Europe after the translation of the book of this mathematician into Latin. Algorithm- description of the sequence of actions (plan), the strict execution of which leads to the solution of the task in a finite number of steps. You constantly come across this concept in various fields of human activity (cooking books, instructions for using various appliances, rules for solving mathematical problems ...). Usually we perform habitual actions without thinking, mechanically. For example, you know well how to open a door with a key. However, in order to teach this to the baby, you will have to clearly explain these actions themselves and the order in which they are performed: 1. Get the key out of your pocket. 2. Insert the key into the keyhole. 3. Turn the key two times counterclockwise. 4. Remove the key. If you look around carefully, you will find many algorithms that we are constantly executing. The world of algorithms is very diverse. Despite this, it is possible to highlight the general properties that any algorithm has. Properties of algorithms: 1. Discreteness (an algorithm must consist of specific actions following in a certain order); 2. Determinism (any action must be strictly and unambiguously defined in each case); 3. Finiteness (each action and the algorithm as a whole must be able to complete); 4. Mass character (the same algorithm can be used with different initial data); 5. Efficiency (no errors, the algorithm must lead to the correct result for all valid input values). Types of algorithms: 1. Linear algorithm (description of actions that are performed once in a given order); 2. Cyclic algorithm (description of actions that must be repeated a specified number of times or until the task is completed); 3. Branching algorithm (an algorithm in which, depending on the condition, either one or another sequence of actions is performed) 4. Auxiliary algorithm (an algorithm that can be used in other algorithms by specifying only its name). For a more visual representation of the algorithm, a graphical form is widely used - a block diagram, which is composed of standard graphic objects.

View of a standard graphic object

Purpose

Beginning of the algorithm

End of algorithm

The action performed is written inside the rectangle

The condition for performing actions is written inside the rhombus

Repeat counter
Sequence of actions
An example of writing an algorithm in the form of a flowchart:

Stages of creating an algorithm: 1. The algorithm must be presented in a form understandable to the person who develops it. 2. The algorithm must be presented in a form understandable to the object (including the person) that will perform the actions described in the algorithm. The object that will execute the algorithm is usually called the executor. Executor- the object that executes the algorithm. Ideal performers are machines, robots, computers... A computer– automatic executor of algorithms. An algorithm written in a computer-friendly programming language is called program. Questions:

    What is an algorithm? Give examples of algorithms. What properties of algorithms do you know? What kinds of algorithms do you know? What ways of writing algorithms do you know? What is an algorithm executor? What is a program?
III. Practical part. In the next few lessons, we will write algorithms in the QBasic programming language. In this lesson, we will learn how to enter and run a QBasic program. Modern programming environments allow programmers to create complex and convenient programs, they have many additional features, but they are not suitable for initial acquaintance with a programming language, because we'll have to first learn their interface for a long time before we can write a simple program. Therefore, we will begin our acquaintance with the QBasic language using the good old shell. Let our programs at this stage not look very attractive, but having mastered the skills of programming, you will then be able to create useful and beautiful programs. To start the QBasic programming environment, give the command Start→All Programs→Programming→QBasic. After launch, a shell will appear on the screen:

Press the key to get started. To switch from windowed mode to full screen mode, press the + keys. To switch the input language in this shell, use the key combination + on the left to the Latin layout, on the right - to Russian. The top line of the window that opens contains a menu of possible operating modes, the bottom line contains a brief reference on the purpose of the main function keys. The rest of the screen belongs to the text editor window and is intended for entering and editing program text. When we enter a program, we work with a text editor built into the QBasic environment. To create program text, you need to enter this text using the keyboard. The cursor position is indicated by a flashing yellow underscore. Each command is entered on a separate line. To move to the next line, press the key. If the text of the program does not interfere on the screen, you can use the scroll bars. Well, you already know how to work in a text editor. Now let's try to write and run a simple program. Write in the program window the text: PRINT "Hi!" The Print command prints text to the screen. The register of the command entry does not matter. Run the program with the Run→Run command or by pressing the key. On the screen you will see the result of your program.

To return to program edit mode, press any key. Try running the program again. Every time you start up, the screen will display the phrase "Hello!". In order for the screen to be cleared when you start your program, add the CLS command. CLS PRINT "Hi!" Now, after starting, the CLS command is first executed - clear the screen, and then the PRINT command - output to the screen. Let's see what happens if you make a mistake when writing a command. Make a mistake in the PRINT command (for example, PRNT). When you run the program, you will receive an error message. The erroneous command will be highlighted.

Click OK and correct the error. The File menu is used to save and open programs. When saving a file, rule 8.3 applies to the file name. Try saving your program to a file. Now let's create a program that calculates the area of ​​a rectangle. CLS a=3 b=4 s=a*b END A, B, C are variables for storing the length, width and area of ​​the rectangle. The variable name can only contain Latin letters and numbers (and start with a letter). We will learn more about variables in the next lesson. END is an end-of-program statement. Variables can be entered from the keyboard. Fix the program like this: CLS INPUT "Enter length: ", a INPUT "Enter width: ", b s=a*b PRINT "The area of ​​the rectangle is ",s END Run the program. Now the values ​​of the variables a and b must be entered from the keyboard. Enter a length value and press , then enter a width value and press again. (i.e. the entry of a value is ended by pressing the key). In the remaining time, try to "play around" with the program yourself. Students complete the task. IV. D / s Know what an algorithm is, properties and types of algorithms, ways to write algorithms. Additional task: find information about the most common programming languages. V. Students' questions. Answers to students' questions. VI. Summary of the lesson. Summing up the lesson. Grading. In the lesson, we got acquainted with what an algorithm is, what properties it has and how it can be written. We started to get acquainted with programming in the QBasic language and wrote our own small program. Lesson number 27. Topic: "Introduction to the Basic programming language". Lesson Objectives:

    to help students learn the concept of an algorithm, a program, to teach how to create simple programs in the Basic programming language. education of information culture of students, attentiveness, accuracy, discipline, perseverance. development of algorithmic thinking, cognitive interests, computer skills.
  • Lesson in informatics and geography in the 10th grade Topic: Global problems of mankind

    Lesson

    The purpose of the lesson: to reveal the content of the concept of "global problems of mankind"; to form knowledge about the global problems of mankind; analyze the causes, essence, features of each of the problems, possible ways to solve them.

  • Abstract

    To promote the use of these concepts in solving problems for which the answer is not a number or a statement, but a description of a sequence of actions;

  • Department of Informatics AbstractMaster class Topic: “Methods of preparing and conducting an open lesson on informatics and ict. Non-standard ways to design a presentation in ms powerpoint "

    Lesson

    The lesson is held in grades 8-11, depending on the school thematic plan, and can also be held in grades 5-7 as part of additional classes as needed, but strictly after studying the MS Word program, i.e.

  • Currently, there are three types of machine translation systems:

    Systems based on grammatical rules (Rule-Based Machine Translation, RBMT);

    Statistical systems (Statistical Machine Translation, SMT);

    hybrid systems;

    Grammar-based systems analyze the text that is used in the translation process. Translation is made on the basis of built-in dictionaries for a given language pair, as well as grammars covering the semantic, morphological, syntactic patterns of both languages. Based on all this data, the source text is sequentially, sentence by sentence, converted into text in the required language. The basic principle of operation of such systems is the connection between the structures of the source and final texts.

    Systems based on grammatical rules are often divided into three more subgroups - word-for-word translation systems, transfer systems and interlinguistic systems.

    The advantages of systems based on grammatical rules are grammatical and syntactic accuracy, stability of the result, and the ability to adjust to a specific subject area. The disadvantages of systems based on grammar rules include the need to create, maintain and update linguistic databases, the complexity of creating such a system, as well as its high cost.

    Statistical systems in their work use statistical analysis. A bilingual corpus of texts is loaded into the system (containing a large amount of text in the source language and its “manual” translation into the required language), after which the system analyzes the statistics of interlingual correspondences, syntactic constructions, etc. The system is self-learning - when choosing a translation option, it relies on previous statistics. The larger the vocabulary within a language pair and the more accurate it is, the better the result of statistical machine translation. With each new translated text, the quality of subsequent translations improves.

    Statistical systems are quick to set up and easy to add new translation directions. Among the shortcomings, the most significant are the presence of numerous grammatical errors and the instability of translation.

    Hybrid systems combine the approaches described earlier. It is expected that hybrid machine translation systems will combine all the advantages that statistical systems and rule-based systems have.

    1.3 Classification of machine translation systems

    Machine translation systems are programs that perform fully automated translation. The main criterion of the program is the quality of the translation. In addition, important points for the user are the convenience of the interface, the ease of integrating the program with other document processing tools, the choice of topics, and the dictionary replenishment utility. With the advent of the Internet, major machine translation vendors have incorporated Web interfaces into their products, while also integrating them with other software and email, allowing MT to be used to translate Web pages, email, and online conversations.

    New members of CompuServe's foreign language forum often ask if anyone can recommend a good machine translation program for a reasonable price.

    The answer to this question is invariably "no". Depending on the respondent, the answer may contain two main arguments: either that machines cannot translate, or that machine translation is too expensive.

    Both of these arguments are valid to a certain extent. However, the answer is far from being so simple. When studying the problem of machine translation (MT), it is necessary to consider separately the various subsections of this problem. The following division is based on lectures by Larry Childs given at the 1990 International Conference on Technical Communication:

    Fully automatic translation;

    Automated machine translation with human participation;

    Translation performed by a person using a computer.

    Fully automated machine translation. This kind of machine translation is what most people mean when they talk about machine translation. The meaning here is simple: text in one language is entered into the computer, this text is processed and the computer outputs the same text in another language. Unfortunately, the implementation of this type of automatic translation faces certain obstacles that have yet to be overcome.

    The main problem is the complexity of the language itself. Take, for example, the meanings of the word "can". In addition to the main meaning of the modal auxiliary verb, the word "can" has several official and slang meanings as a noun: "bank", "latrine", "prison". In addition, there is an archaic meaning of this word - "to know or understand." Assuming that the target language has a separate word for each of these values, how can a computer tell them apart?

    As it turns out, some progress has been made in developing translation programs that discriminate meaning based on context. More recent studies in the analysis of texts rely more on probability theory. However, fully automated machine translation of texts with extensive subject matter is still an impossible task.

    Automated machine translation with human participation. This kind of machine translation is now quite feasible. Speaking about machine translation with the participation of a person, they usually mean editing texts both before and after they are processed by a computer. Human translators change texts so that they can be understood by machines. After the computer has done the translation, people again edit the rough machine translation, making the text in the target language correct. In addition to this order of work, there are MT systems that require the constant presence of a human translator during translation to help the computer translate especially complex or ambiguous structures.

    Human-assisted machine translation is applicable to a greater extent to texts with a limited vocabulary of narrowly limited topics.

    The economics of using human-assisted machine translation is still debatable. The programs themselves are usually quite expensive, and some of them require special equipment to work. Pre and post editing needs to be learned, and it's not a pleasant job. Creating and maintaining databases of words is a laborious process and often requires special skills. However, for an organization translating large volumes of texts in a well-defined subject area, human-assisted machine translation can be a fairly cost-effective alternative to traditional human translation.

    Translation performed by a person using a computer. With this approach, the human translator is placed at the center of the translation process, while the computer program is regarded as a tool that makes the translation process more efficient and the translation accurate. These are ordinary electronic dictionaries that provide a translation of the required word, making the person responsible for choosing the right option and the meaning of the translated text. Such dictionaries greatly facilitate the process of translation, but require the user to have a certain knowledge of the language and spend time on its implementation. And yet the translation process itself is greatly accelerated and facilitated.

    Among the systems that help the translator in his work, the most important place is occupied by the so-called Translation Memory (TM) systems. TM systems are an interactive tool for accumulating pairs of equivalent text segments in the original language and translation in a database with the possibility of their subsequent search and editing. These software products are not intended to use highly intelligent information technologies, but, on the contrary, are based on the use of the translator's creative potential. In the course of work, the translator himself forms the database (or receives it from other translators or from the customer), and the more units it contains, the greater the return on its use.

    Here is a list of the most famous TM systems:

    Transit Swiss company Star,

    Trados (USA),

    Translation Manager from IBM,

    Eurolang Optimizer of the French company LANT,

    DejaVu from ATRIL (USA),

    WordFisher (Hungary).

    TM systems make it possible to exclude the repeated translation of identical text fragments. The translation of a segment is carried out by the translator only once, and then each subsequent segment is checked for a match (full or fuzzy) with the database, and if an identical or similar segment is found, then it is offered as a translation option.

    Currently, developments are underway to improve the TM systems. For example, the core of Star's Transit system is based on neural network technology.

    Despite the wide range of TM systems, they share a few common features:

    Alignment function. One of the advantages of TM systems is the ability to use already translated materials on this topic. The TM database can be obtained by segment-by-segment comparison of the original and translation files.

    Availability of import-export filters. This property ensures the compatibility of TM systems with a variety of word processors and publishing systems and gives the translator relative independence from the customer.

    Mechanism for searching for fuzzy or complete matches. It is this mechanism that represents the main advantage of TM systems. If, when translating a text, the system encounters a segment that is identical or close to the previously translated one, then the already translated segment is offered to the translator as a variant of the translation of the current segment, which can be corrected. The degree of fuzzy matching is set by the user.

    Support for thematic dictionaries. This feature helps the translator to stick to the glossary. As a rule, if a translated segment contains a word or phrase from a thematic dictionary, it is highlighted in color and its translation is offered, which can be inserted into the translated text automatically.

    Means of searching for fragments of text. This tool is very handy when editing a translation. If in the process of work a more successful version of the translation of any fragment of the text was found, then this fragment can be found in all TM segments, after which the necessary changes are sequentially made to the TM segments.

    Of course, like any software product, TM systems have their advantages and disadvantages, and their scope. However, with regard to TM systems, the main disadvantage is their high cost.

    It is especially convenient to use TM systems when translating documents such as user manuals, operating instructions, design and business documentation, product catalogs and other documents of the same type with a large number of matches.

    Topic: “Computer translators. Text recognition systems.

    Lesson Objectives:

      to help students gain an understanding of computer dictionaries and machine translation systems, to get acquainted with the capabilities of these programs, to teach how to use these programs. help students gain an understanding of OCR - text recognition programs, get acquainted with the capabilities of the program's data, teach to recognize scanned text, transfer and edit it in Word. education of information culture of students, attentiveness, accuracy, discipline, perseverance. development of cognitive interests, computer skills, self-control, ability to take notes.

    Equipment:
    board, computer, computer presentation.

    Lesson plan:

    1) Organizational moment. (1 min)

    2) Actualization of knowledge. (5 minutes)

    3) Theoretical part. (10 minutes)

    4) Practical part. (15 minutes)

    5) Homework (2 min)

    6) Students' questions. (5 minutes)

    7) The result of the lesson. (2 minutes)

    During the classes:

    I. Organizational moment.

    Greeting, checking those present. Explanation of the lesson.

    II. Knowledge update.

    As you can see, in order to get an electronic, ready-to-edit copy of any printed text, the OCR program needs to perform a “chain” of many separate operations.

    First you need to recognize the structure of the text on the page: select columns, tables, images, and so on. Next, the selected text fragments of the graphic image of the page must be converted to text.

    If the source document has typographical quality (a fairly large font, no poorly printed characters or corrections), then the recognition problem is solved by comparing with a raster template. First, the bitmap of the page is split into individual character images. Then each of them is successively superimposed on the symbol templates available in the system memory, and the template with the least number of points different from the input image is selected.

    When recognizing documents with low print quality (typewritten text, fax, and so on), the method of character recognition is used by the presence of certain structural elements in them (segments, rings, arcs, etc.).

    Any symbol can be described through a set of parameter values ​​that determine the relative position of its elements. For example, the letter "H" and the letter "I" consist of three segments, two of which are parallel to each other, and the third connects these segments. The difference between these letters is in the size of the angles that the third segment forms with the other two.

    When recognizing by the structural method, characteristic details are distinguished in a distorted symbolic image and compared with the structural patterns of symbols. As a result, the symbol is selected for which the totality of all structural elements and their arrangement most of all corresponds to the recognized symbol.

    The most common optical character recognition systems, such as ABBYY FineReader and CuneiForm from Cognitive, use both raster and structured recognition methods. In addition, these systems are "self-learning" (for each specific document, they create an appropriate set of character templates) and therefore the speed and quality of recognition of a multi-page document is gradually increasing.

    You can purchase text recognition software separately or get it for free along with the scanner you bought.

    Perhaps the most famous OCR software is ABBYY's FineReader. It is this program that is most often remembered when it comes to recognition systems.

    FineReader allows you to recognize texts typed in almost any font, without prior training. A feature of the FineReader program is high recognition accuracy and low sensitivity to print defects, which is achieved through the use of "holistic purposeful adaptive recognition" technology.

    FineReader has a ton of extra features that the average user may not need, but will impress certain groups of buyers. So, one of FineReader's trump cards is support for an incredible number of recognition languages ​​- 176, among which you will find exotic and ancient languages, and even popular programming languages.

    But not all features are included in the simplest modification of the program, which you can get for free along with the scanner. Batch scanning, competent processing of tables and images - for all this it is worth purchasing a professional version of the program.

    All versions of FineReader, from the simplest to the most powerful, share a user-friendly interface. To start the recognition process, you just need to put the document in the scanner and press the only button (Scan & Read wizard) on the program toolbar. All further operations - scanning, splitting the image into "blocks" and, finally, the actual recognition of the program will perform automatically. The user will only need to set the desired scanning parameters.

    The quality of recognition largely depends on how good the image is obtained during scanning. Image quality is controlled by setting the basic scan parameters: image type, resolution, and brightness.

    Scanning in gray is the optimal mode for the recognition system. In the case of scanning in gray mode, automatic selection of brightness is performed. If you want the color elements contained in the document (pictures, lettering and background colors) to be transferred to an electronic document with color preservation, you must select a color image type. Otherwise, use the gray image type.

    The optimal resolution for ordinary texts is 300 dpi and 400-600 dpi for texts in small print (9 or less points).

    Once page recognition is complete, FineReader will offer the user a choice: scan and recognize further (for a multi-page document) or save the resulting text in one of many popular formats - from Microsoft Office documents to HTML or PDF. You can, however, immediately transfer the document to Word or Excel, and already there correct all the recognition flaws (it is simply impossible to do without it). At the same time, FineReader fully preserves all the features of document formatting and its graphic design.

      Why do we need text recognition programs? How is text recognition done? What text recognition programs do you know? What did they use? What resolution is optimal for scanning text, images?

    III. Practical part.
    1. Working with a text translator (by cards)
    2. Now let's practice working with ABBYY FineReader. We will use a simplified version of the program that comes with the scanner.

    IV. Homework
    Know what automatic text translation programs are, be able to work with these programs. Additional task: connect to the Internet and use any on-line translator to translate the text.
    Know what text recognition programs are, be able to work with these programs. Additional task: install the OCR program at home and prepare an essay on any subject. Recognize text in OCR, edit and format in Word.

    V. Students' questions.
    Answers to students' questions.

    VI. Summary of the lesson.
    Summing up the lesson. Grading.

    At the lesson, we got acquainted with computer translation programs for texts, learned how to translate words and text using a translator program.

    At the lesson, we got acquainted with OCR programs, learned how to recognize a scanned image using ABBYY FineReader 5.0.

    Knowledge of foreign languages ​​is not only a useful skill in everyday life, but also one of the main requirements when applying for a job. At present, the need for knowledge of one or even several foreign languages ​​is becoming increasingly relevant.

    The knowledge of the language (English or German) is necessary for the secretary not only when traveling on vacation abroad, but also when receiving business partners from abroad, in everyday life when reading news or watching movies. Therefore, a large number of routine routine and everyday operations that did not previously require knowledge of a foreign language, today, due to the development of international integration processes and the widespread desire of business for globalization, are becoming increasingly difficult if you rely on only one language.

    Today, however, knowledge of foreign languages ​​alone is not enough, since the volume of information that needs to be translated daily has increased significantly. At the same time, this task is successfully solved, and it is not difficult for anyone to translate a contract or the content of a foreign site in just a few seconds. And all because in this case the translation program is engaged in translation: a person does not have time to blink an eye, and the translation is already ready.

    Target This lesson is to help students get an idea about computer dictionaries and machine translation systems, to introduce the capabilities of these programs, to teach how to use these programs.

    Download:


    Preview:

    Lesson topic: "Systems of machine translation of texts and computer dictionaries"

    Lesson Objectives:

    • to help students get an idea about computer dictionaries and machine translation systems, to acquaint them with the capabilities of these programs, to teach how to use these programs.
    • development of cognitive interests, self-control, ability to take notes.
    • education of information culture of students, attentiveness, accuracy, discipline, perseverance.

    Lesson type: lesson learning new material

    Equipment:

    computer, projector, screen, computer presentation.

    During the classes:

    I. Org. moment

    Greeting, checking those present.

    II. Knowledge update

    Knowledge of at least one foreign language is necessary today for everyone, like air. Especially for users: after all, it is impossible to avoid a collision with the English language when working on a computer. Well, if it only concerned files with documentation for programs (which traditionally no one reads in our country) or the names of program menus (which you can learn to navigate without knowing the language, by scientific poke and cramming).

    It's bad if you don't know a foreign language... However, I'm sorry - in part - you can help by installing one of the specialized translation programs on your computer or using computer dictionaries.

    Write down the topic of the lesson “Machine translation systems for texts
    and computer dictionaries.

    Today in the lesson you will get an idea about computer dictionaries and automatic text translation systems and get acquainted with the capabilities of these programs.

    III. Introduction to new material

    With the advent of writing, people received a powerful tool for storing knowledge and for communication. The first writings that have come down to us on the walls of temples and tombs tell about the deeds of kings and generals that took place many centuries ago. In addition, people recorded the results of economic activities in order to successfully trade, collect taxes, etc.

    To facilitate written communication between peoples, the first dictionaries were created. One of these dictionaries was written by Sumerian priests on clay tablets. Each tablet was divided into two equal parts. On the one hand, a Sumerian word was written, and on the other, a word of a similar meaning in another language, sometimes with a brief explanation. From that time to the present day, the structure of dictionaries has not changed much.

    Currently, there are thousands of dictionaries for translating between hundreds of languages ​​(English-Russian, German-French, and so on), and each of them can contain tens of thousands of words.

    In the paper version, the dictionary is a thick book with hundreds of pages, where searching for the right word is a rather laborious process.

    The current globalization of our world leads to the need to exchange documents between people and organizations located in different countries of the world and speaking different languages.

    Under these conditions, the use of traditional manual translation technology hinders the development of interethnic contacts. Manual translation of multi-page documentation requires a long time and high wages for translators. Translation of an e-mail received or a Web page viewed in a browser must be done immediately, and there is no opportunity and time to invite a translator.

    Machine translation systems solve these problems.

    With the advent of a personal computer, electronic dictionaries began to be created, which made it easier to find the right word and offered many new useful functions (voicing a word, searching for synonyms, etc.).

    Machine translation technology has been gradually improved. And if the quality and speed of translation of the first systems left much to be desired, now a computer can really translate text from one language into another in a coherent manner. And more modern systems with acceptable quality translate 1 page of text in 1 second.

    Computer dictionaries are multilingual.

    They allow the user to select languages ​​and the direction of translation (for example, English-Russian, Spanish-Russian, etc.)

    In addition to the main dictionary of commonly used words, they can contain dozens of specialized dictionaries in areas of knowledge (technology, medicine, etc.)

    Provide a quick search for dictionary entries.

    Allows you to enter phrases and texts.

    May be multimedia, i.e. provide the user with the opportunity to listen to words performed by speakers, native speakers.

    Basic requirements for computer translators

    Efficiency consists in the possibility of constant updating of vocabulary and thematic sections.

    Flexibility designed for a specific subject area.

    Speed - the possibility of auto-input and processing of text information from paper. One such system (OCR-System) replaces more than ten experienced typists every day.

    Accuracy is literacy and adequate transfer of the meaning of the translated text into the target language.

    Computer translation systems have their pros and cons.

    pros

    High translation speed. Computer translation systems allow you to translate multi-page documents at high speed.

    Versatility.A professional translator, as a rule, has a specialization in translating texts on a certain subject. Translation programs allow you to translate technical documentation, business correspondence and other specialized texts with sufficient quality.

    Low cost of translation. Using the services of professional translators, we are forced to pay money for each page of the translation. However, often there is no need to get a perfect translation of the text, but you need to catch the meaning of the letter sent or the content of the page on the Internet. In this case, the translation system will undoubtedly become a reliable and effective assistant.

    Confidentiality. Many users regularly use machine translation systems to translate personal letters, because not every person is ready to give an outside translator personal correspondence or entrust the translation of commercial or financial documents.

    On-line translation and translation of web content. Allows you to translate web pages in real time. Online translation services are always at hand and will help you quickly translate information at the right time. In addition, today, with the help of translation systems, you can translate the content of Internet pages and search engine queries.

    Minuses

    But these systems cannot be completely relied upon. They aremake semantic and stylistic errors and not applicable to translation of works of art, as they are not able to adequately translate metaphors, allegories and other elements of human artistic creativity.

    This happens because they translate texts based on formal “knowledge”: the syntax of the language (the rules for constructing sentences), the rules of word formation and the use of dictionaries.

    The use of machine translation without topic customization is the subject of many Internet jokes.

    The phrase "Our cat gave birth to three kittens - two whites and one black", which the PROMT online translator turned into "Our cat gave birth to three kittens - two whites and one African American." If “African American” could still be made “black” by writing “black kitten”, then “cat” could not change gender: for example, female cat was translated as “female cat”.

    The program does not recognize the context of the phrase and translates the terms verbatim, besides not distinguishing proper names from ordinary words. The same PROMT translator turns "Leo Tolstoy" into "Lion Thick" ("fat lion").

    Google Translator, on the contrary, often took the word "rice" (rice) for the surname of the former US Secretary of State.

    What to do to avoid such mistakes?

    • Before you start translating, you needdefine text type, that is, from which area of ​​​​human life it is represented (economics, sports, science, etc.). After all, each area has its own nuances and terms.
    • Mistranslations are often caused by typos translated text. This also applies to recognized texts. Words with errors are marked by translators as unfamiliar, because they are not in dictionaries in this form. Worse if there iserrors in punctuation- one incorrectly placed comma can distort the translation of the entire sentence.
    • Work with text snippets. Never translate the entire text at once. It will always contain words that are not in the dictionary and those that the system does not translate correctly.

    Online computer translators on the Internet provide a choice of thematic dictionary and the direction of translation. They allow you to translate any text typed in the translation window or copied from the clipboard, web pages, including hyperlinks, as well as emails.

    The most popular are Google,PROMT, Yandex.translation.

    Translate.ru is the first Russian web service designed to translate text or web pages into other languages. When translating a single word, it produces a dictionary entry. Opened on March 6, 1998 by PROMT.

    Yandex.Translation is a Yandex web service designed to translate part of a text or web page into another language. The system builds its correspondence dictionaries based on the analysis of millions of translated texts. The text for translation is first compared by the computer with the database of words, then with the database of language models, trying to determine the meaning of the expression in the context.

    Google Translate is a Google web service designed to automatically translate part of a text or web page into another language. For some languages, translation options are offered to users, for example, for technical terms that should be included in translation system updates in the future. The service also includes the translation of the entire web page and even the simultaneous search for information with translation into another language.

    Using a machine translation system can significantly reduce the time required to translate texts. The best among Russian machine translation systems arePROMT, ABBYY Lingvo, Multitran, SlovoEd.

    Electronic dictionary ABBYY Lingvo. The multilingual version covers 15 languages. ABBYY Lingvo does not have a full-text translation function, but word-by-word translation of texts from the clipboard is possible.

    Electronic dictionary Promt. PROMT offers translation programs and PROMT dictionaries for translating texts from English, German, French, Spanish and other languages ​​into Russian and vice versa. A complete solution for automatic translation of business and technical documents, e-mail, websites and presentations.

    Translators are available not only for personal computers, but also for mobile phones, tablets and other devices.

    For example, in order to quickly and conveniently translate the necessary paragraphs of texts, a program for Android - Quick Translator was created. Its main advantage is that it translates texts after copying them to the clipboard. All you have to do is highlight the parts of the text you need to get a quick translation right on your Android tablet or phone.

    IV. Consolidation of the studied material

    We met in classcomputer dictionaries and automatic text translation systems and the capabilities of these programs.

    1. What are the requirements for computer translators?

    2. What are the advantages of computer dictionaries over traditional paper dictionaries?

    3. When is it advisable to use computer translation systems?

    4. What texts should not be translated using computer translators?

    5. What are the names of the best Russian machine translation systems?

    V. Homework

    Prepare for practical work

    VI. Summary of the lesson.

    Summing up the lesson. Grading.

    LIST OF USED LITERATURE

    Mikheeva E.V. Information technologies in professional activity. – M.: Academy, 2006. – 384 p.

    Sapkov V.V. Information technologies and computerization of office work. – M.: Academy, 2006. – 288 p.

    Types of machine translation systems //logrus.ru

    Documents drawn up in foreign languages, the travel agency should be translated into Russian. Otherwise, the expenses on them will not be accepted for tax purposes. However, sometimes translation is not needed. For example, if we are talking about the details of an electronic ticket encoded in Latin (letter of the Federal Tax Service of Russia dated June 7, 2011 No. ED-4-3 / 8983).

    The need for translation of documents
    In accordance with paragraph 1 of Article 16 of the Law of the Russian Federation of October 25, 1991 No. 18071 “On the languages ​​of the peoples of the Russian Federation”, official office work in organizations in our country is conducted in Russian.

    And as enshrined in paragraph 9 of the Regulation on accounting and financial reporting in the Russian Federation, approved by order of the Ministry of Finance of Russia dated July 29, 1998 No. 34n, accounting of property, liabilities and business transactions (facts of economic activity) is recorded in the currency of the Russian Federation - in rubles .

    At the same time, documentation of property, liabilities and other facts of economic activity, maintenance of accounting and reporting registers are carried out in Russian. Further it is said: primary accounting documents drawn up in other languages ​​must have a line-by-line translation into Russian.

    Based on these norms, the regulatory authorities conclude that primary documents, if they are drawn up in a foreign language, must be translated into Russian. This opinion is expressed, in particular, in the letters of the Ministry of Finance of Russia dated November 3, 2009 No. 03-03-06 / 725, dated September 14, 2009 No. 03-03-05 / 170, dated February 16, 2009 No. 03- 03-05/23.

    Although organizations manage to defend expenses in courts, which are confirmed by documents without translation (decrees of the Federal Antimonopoly Service of the Moscow District dated April 21, 2011 No. KA-A40 / 2152-11, dated October 8, 2008 No. KA-A40 / 8061-08).

    Also, arbitrators most often take the side of taxpayers, pointing out that the lack of translation of documents into Russian cannot serve as a basis for refusing to deduct VAT. An example of this is the resolution of the Federal Antimonopoly Service of the Moscow District of April 1, 2009 No. KA-A40 / 132809, of March 16, 2009 No. KA-A40 / 1450-09, the Federal Antimonopoly Service of the West Siberian District of March 5, 2007 No. F04-979 /2007(31967-A45-14).

    However, if the documents of the travel agency do not have a translation, it will most likely be necessary to defend the possibility of tax accounting for expenses or the right to a deduction in court. In this case, the outcome of the litigation may not be in favor of the taxpayer.

    How to translate a document
    The financial department clarifies that the translation can be done by both a professional translator and the organization itself, or rather, its employee (letters dated September 14, 2009 No. 03-03-05 / 170, dated March 20, 2006 No. 03-02-07 /1-66).

    At the same time, the legislation does not establish how such a document should be drawn up. Therefore, it can be executed as a separate document or inscribed in Russian text on a photocopy of a foreign primary document.

    It should be noted that the translation can also be performed by the organization that issued the primary document, for example, in the form of a certificate (letter of the Ministry of Finance of Russia No. 03-03-05 / 170).

    When can you do without translation?
    In some cases, documents will not need to be translated.

    This does not need to be done, firstly, if you regularly receive standard documents from your foreign counterparties, in which only numerical indicators differ (number, date of the document, price, etc.), it is enough to translate the form of the document into Russian once. Explanations on this issue are given in the letter of the Ministry of Finance of Russia dated November 3, 2009 No. 03-03-06/1/725.

    Secondly, it is not required to translate information that is not essential for confirming the expenses incurred.

    For example, the conditions for applying the fare, the rules of air transportation, the rules for the carriage of baggage. Officials of the main financial department drew attention to this in a letter dated September 14, 2009 No. 03-03-05 / 170.

    Thirdly, there is no need to translate the formalized (encoded) details of an electronic ticket filled in using Latin characters (letters of the Federal Tax Service of Russia dated June 7, 2011 No. ED-4-3 / 8983, dated April 26, 2010 No. ШС-37- 3/ [email protected]).

    But when the values ​​in the electronic ticket are actually indicated in a foreign language and do not coincide with the formalized (encoded) values ​​or codes in accordance with the Unified International Codifiers, then these indicators (values) of the ticket are subject to translation into Russian.

    Accounting for translation costs
    In accounting, the expenses of a travel company for payment for translation services are included in other expenses in the month in which they are provided. This is reflected in the debit entry of account 91 "Other income and expenses" (sub-account "Other expenses") and the credit of account 76 "Settlements with various debtors and creditors" (clauses 11, 16, 18 PBU 10/99 "Expenses of the organization").

    Such expenses are also accepted for profit taxation purposes - as part of others as payment for information services (subclause 14, clause 1, article 264 of the Tax Code of the Russian Federation) or as other other expenses. About this - in the letter of the Federal Tax Service of Russia for Moscow dated May 26, 2008 No. 20-12 / 050126. True, this rule applies to the cost of a third-party translation.

    Recall that in order to comply with the requirements of Article 252 of the Tax Code of the Russian Federation, the costs of translating documents must be documented.

    With the "simplification" to take into account such costs will not work. Since they are not provided for in the closed list of permitted expenses (clause 1 of article 346.16 of the Tax Code of the Russian Federation).

    Important to remember

    The costs of third-party translation of documents can be deducted for income tax purposes. But it is impossible to accept them with a “simplification”.