This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
picto:picto2_ocr_labelling_tool [2020/07/17 10:38] karma |
picto:picto2_ocr_labelling_tool [2021/06/11 07:39] (current) karma [At the end] |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Picto2_OCR Data Labeling Handwriting ====== | + | ====== OCR Labeling - Transcription Project ====== |
- | ==== Tool and Test files ==== | + | ==== Tool and files ==== |
Thank you for your interest in working with us on this project. \\ | Thank you for your interest in working with us on this project. \\ | ||
Line 7: | Line 7: | ||
Please follow the next steps to access the OCR tool and do the test. \\ | Please follow the next steps to access the OCR tool and do the test. \\ | ||
- | * Create a folder that says ** OCR Data Labeling Handwriting ** | + | * Create a folder |
- | * Download from our server [[https://nc.oho2.com/s/fNRmZfo27nH898C|the tool]] in that folder and the test: | + | * Download the tool from our server (the Project will give you the tool) in that folder: |
- | * [[https://nc.oho2.com/s/wQtfEdtMCsjFSXk|German language]] | + | * Extract the files with winRAR or winZIP\\ |
- | * [[https://nc.oho2.com/s/bL2k6AwcZY9tAWEm|Italian language]] | + | * Download the files from our server (//your language// - **Images to do**) in that folder and extract them (Attention! Your project manager will tell the file names in our working chat oho2.com or per email) |
- | * [[https://nc.oho2.com/s/qgpBW6ijXCDDzXz|Spanish language]] | + | |
- | * [[https://nc.oho2.com/s/7YroysKgqKc5qk7|French language]] | + | |
- | * [[https://nc.oho2.com/s/kyYLWBrXprb8xDB|Portuguese language]] | + | |
- | * [[https://nc.oho2.com/s/jXRYaQCwbcdzm2J|Japanese language]] | + | |
- | * [[https://nc.oho2.com/s/nZzXpPDKB4gQwwR|Korean language]] | + | |
- | * Extract the files | + | |
- | * In the test folder you will find several subfolders that contain the images to work with. \\ | + | |
| | ||
==== Open the tool ==== | ==== Open the tool ==== | ||
- | Open the extracted folder **Latin_LiteOcrLabelingTool** \\ | + | Open the extracted folder with the **Labeling Tool** \\ |
- | Doubleclick on file **Latin_LiteOcrLabelingTool.exe** \\ | + | Double click on .exe file \\ |
- | {{: picto: selec.png |}} | + | {{ :picto:tool-open1.jpg?nolink |}} |
You'll see the following\\ | You'll see the following\\ | ||
- | + | {{ :picto:tool222.jpg?nolink |}} | |
- | {{: picto: presio.png |}} | + | |
* Click on menu **File** > **Labeling settings** | * Click on menu **File** > **Labeling settings** | ||
* Choose the Images and Results Folders. They should be the same | * Choose the Images and Results Folders. They should be the same | ||
Line 33: | Line 25: | ||
* Select the same folder for results | * Select the same folder for results | ||
{{ :picto:ocr_labeling_tool_browse.jpg?nolink |}} | {{ :picto:ocr_labeling_tool_browse.jpg?nolink |}} | ||
+ | * Click on File > **Open** > OK and the tool opens the chosen folder | ||
+ | * The first file is opened | ||
+ | {{ :picto:picto2-111.png?nolink |}} | ||
+ | |||
+ | |||
+ | ==== How to transcribe ==== | ||
+ | * Double click on the box and an empty segment appears, transcribe the text exactly as it is | ||
+ | {{ :picto:transcription1.jpg?nolink |}} | ||
+ | |||
+ | * **Please transcribe all text in the boxes and follow the [[https://nm.oho2.com/index.php/f/21314|guides]].**\\ | ||
+ | | ||
+ | * Files "T3 Characters set.xlsx" and "T3 Symbol Set_EN.xlsx" are helpful to know what characters and symbols you may use while work. | ||
+ | |||
+ | * When you finish all text lines, mark the image "As labeled". It will be marked blue. | ||
+ | {{ :picto:picto2-333.jpg?nolink |}} | ||
+ | |||
+ | * When you finish, the tool will update xlm-files in the working folder automatically.\\ | ||
+ | |||
+ | * Please upload the whole folder in one archive to our server (//Project name// - Images 2deliver - your language) and let your project manager know. Alternatively you may use WeTransfer to deliver the files. | ||
+ | |||
+ | ==== Trouble shooting === | ||
+ | |||
+ | * If the tool doesn't open, please change language to US English in your system like below:\\ | ||
+ | {{ :picto:windows-10-1809-data-formats.jpg?nolink |}} | ||
+ | |||
+ | * If the picture is too small, hold "Ctrl" and scroll the mouse wheel to zoom the image. Double click on the box and tipp the text, then click "Enter" | ||
+ | {{ :picto:picto2-222.jpg?nolink |}} | ||
+ | |||
+ | * If you need to add a special symbol you may use Alt Codes - https://www.alt-codes.net/ | ||
+ | |||
+ | ==== At the end === | ||
+ | |||
+ | * Your project manager will give you login ang password to our cloud server. | ||
+ | |||
+ | * Go to folder (//your language// - **Ready images**) and upload all ready images there. | ||
+ | |||
+ | * To estimate your work please use this [[https://nm.oho2.com/index.php/f/21320|tool]]. About once a week you should send the logs to me, and I will create a Purchase Order. Then you may send your invoice to us. | ||
- | will take you to find the first folder of the ES_test \\ files | ||
- | Select the first folder which is Document \\ | ||
- | Then click on the second Browse \\ | ||
- | Select the same folder that is Document \\ | ||
- | Press OK \\ | ||
- | Go to File \\ | ||
- | Open \\ | ||
- | will take you to this window \\ | ||
- | \\ | ||
- | Find this subfolder that says Document and press OK \\ | ||
- | The image opens \\ | ||
- | You start to do the exam, that is to say to frame all the lines that have text and follow the \\ | ||
- | indications of the guides. For this exam guide yourself with the guide \\ | ||
- | Handwriting_Tier1_DevTestLabeling_Criteria_v1.0 without ignoring some indications of the other \\ | ||
- | guide, so it is important to read both. \\ | ||
- | \\ | ||
- | This is how it should look once it is finished. \\ | ||
- | \\ | ||
- | Make sure the tags on the right are in Handwriting, Spanish, and by the rules other tags. \\ | ||
- | Always deselecting ENGLISH. \\ |