User Tools

Site Tools


picto:picto2_ocr_labelling_tool

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
picto:picto2_ocr_labelling_tool [2020/07/17 10:38]
karma
picto:picto2_ocr_labelling_tool [2021/06/11 07:39] (current)
karma [At the end]
Line 1: Line 1:
-====== ​Picto2_OCR Data Labeling ​Handwriting ​======+====== ​OCR Labeling ​- Transcription Project ​======
  
-==== Tool and Test files ====+==== Tool and files ====
  
 Thank you for your interest in working with us on this project. \\ Thank you for your interest in working with us on this project. \\
Line 7: Line 7:
 Please follow the next steps to access the OCR tool and do the test. \\ Please follow the next steps to access the OCR tool and do the test. \\
  
-  * Create a folder ​that says ** OCR Data Labeling Handwriting ** +  * Create a folder 
-  * Download from our server ​[[https://​nc.oho2.com/​s/​fNRmZfo27nH898C|the tool]] in that folder ​and the test+  * Download ​the tool from our server ​(the Project will give you the toolin that folder: 
-       ​[[https://​nc.oho2.com/​s/​wQtfEdtMCsjFSXk|German language]] +  Extract the files with winRAR or winZIP\\ 
-       ​[[https://nc.oho2.com/​s/​bL2k6AwcZY9tAWEm|Italian ​language]] +  Download the files from our server (//your language// ​**Images to do**) in that folder ​and extract them (Attention! Your project manager ​will tell the file names in our working chat oho2.com or per email)
-       * [[https://nc.oho2.com/​s/​qgpBW6ijXCDDzXz|Spanish language]] +
-       [[https://​nc.oho2.com/​s/​7YroysKgqKc5qk7|French language]] +
-       [[https://​nc.oho2.com/​s/​kyYLWBrXprb8xDB|Portuguese language]] +
-       [[https://​nc.oho2.com/​s/​jXRYaQCwbcdzm2J|Japanese language]] +
-       [[https://​nc.oho2.com/​s/​nZzXpPDKB4gQwwR|Korean language]] +
-  * Extract the files +
-  * In the test folder ​you will find several subfolders that contain ​the images to work with\\+
   ​   ​
 ==== Open the tool ==== ==== Open the tool ====
  
-Open the extracted folder **Latin_LiteOcrLabelingTool** \\ +Open the extracted folder ​with the **Labeling Tool** \\ 
-Doubleclick ​on file **Latin_LiteOcrLabelingTool.exe** \\ +Double click on .exe file \\ 
-{{: picto: ​selec.png |}}+{{ :picto:tool-open1.jpg?​nolink ​|}}
 You'll see the following\\ You'll see the following\\
- +{{ :picto:tool222.jpg?​nolink ​|}}
-{{: picto: ​presio.png |}}+
   * Click on menu **File** > **Labeling settings**   * Click on menu **File** > **Labeling settings**
   * Choose the Images and Results Folders. They should be the same   * Choose the Images and Results Folders. They should be the same
Line 33: Line 25:
   * Select the same folder for results   * Select the same folder for results
 {{ :​picto:​ocr_labeling_tool_browse.jpg?​nolink |}} {{ :​picto:​ocr_labeling_tool_browse.jpg?​nolink |}}
 +  * Click on File > **Open** > OK and the tool opens the chosen folder
 +  * The first file is opened
 +{{ :​picto:​picto2-111.png?​nolink |}}
 +
 +
 +==== How to transcribe ====
 +  * Double click on the box and an empty segment appears, transcribe the text exactly as it is
 +{{ :​picto:​transcription1.jpg?​nolink |}}
 +
 +  * **Please transcribe all text in the boxes and follow the [[https://​nm.oho2.com/​index.php/​f/​21314|guides]].**\\
 +  ​
 +  * Files "T3 Characters set.xlsx"​ and "T3 Symbol Set_EN.xlsx"​ are helpful to know what characters and symbols you may use while work.
 +
 +  * When you finish all text lines, mark the image "As labeled"​. It will be marked blue.
 +{{ :​picto:​picto2-333.jpg?​nolink |}}
 +
 +  * When you finish, the tool will update xlm-files in the working folder automatically.\\
 +
 +  * Please upload the whole folder in one archive to our server (//Project name// - Images 2deliver - your language) and let your project manager know. Alternatively you may use WeTransfer to deliver the files.
 +
 +==== Trouble shooting ===
 +
 +  * If the tool doesn'​t open, please change language to US English in your system like below:\\
 +{{ :​picto:​windows-10-1809-data-formats.jpg?​nolink |}}
 +
 +  * If the picture is too small, hold "​Ctrl"​ and scroll the mouse wheel to zoom the image. Double click on the box and tipp the text, then click "​Enter"​
 +{{ :​picto:​picto2-222.jpg?​nolink |}}
 +
 +  * If you need to add a special symbol you may use Alt Codes - https://​www.alt-codes.net/​
 +
 +==== At the end ===
 +
 +  * Your project manager will give you login ang password to our cloud server.
 +
 +  * Go to folder (//your language// - **Ready images**) and upload all ready images there.
 +
 +  * To estimate your work please use this [[https://​nm.oho2.com/​index.php/​f/​21320|tool]]. About once a week you should send the logs to me, and I will create a Purchase Order. Then you may send your invoice to us.
  
-will take you to find the first folder of the ES_test \\ files 
-Select the first folder which is Document \\ 
-Then click on the second Browse \\ 
-Select the same folder that is Document \\ 
-Press OK \\ 
-Go to File \\ 
-Open \\ 
-will take you to this window \\ 
-\\ 
  
-Find this subfolder that says Document and press OK \\ 
-The image opens \\ 
-You start to do the exam, that is to say to frame all the lines that have text and follow the \\ 
-indications of the guides. For this exam guide yourself with the guide \\ 
-Handwriting_Tier1_DevTestLabeling_Criteria_v1.0 without ignoring some indications of the other \\ 
-guide, so it is important to read both. \\ 
-\\ 
-This is how it should look once it is finished. \\ 
-\\ 
-Make sure the tags on the right are in Handwriting,​ Spanish, and by the rules other tags. \\ 
-Always deselecting ENGLISH. \\ 
picto/picto2_ocr_labelling_tool.1594982291.txt.gz · Last modified: 2020/07/17 10:38 by karma