Which of the following OCR (Optical Character Recognition) engines is not free of charge?
A. Tesseract.
B. Microsoft Azure OCR.
C. OmniPage.
D. Microsoft OCR.
Which of the following is a best practice when choosing a UiPath ML (Machine Learning) Extractor?
A. The popularity of the ML Extractor among other UiPath users should be the primary factor when choosing a UiPath ML Extractor. Opt for the ML Extractor that has the highest number of downloads or positive reviews.
B. Consider the document types, language, and data quality when choosing an ML Extractor. It is important to select one that is specifically trained or optimized for the document types being processed. It is also important to take into account the quality and diversity of the training data used to train the ML Extractor to ensure accurate and reliable extraction results.
C. The cost of the ML Extractor should be the main consideration when choosing an ML Extractor. Select the ML Extractor that offers the lowest price, regardless of its performance or suitability for the specific document understanding needs.
D. The size of the ML Extractor is the most important factor to consider when choosing an ML Extractor. Bigger models always perform better and provide more accurate extraction results because the development team invested time and effort into creating the algorithm, which in turn will result in better performance for the trained model.
Which UiPath Communications Mining model performance factor assesses the proportion of the entire dataset that has informative label predictions?
A. Average label performance.
B. Coverage.
C. Balance.
D. Underperforming labels.
What does adding missed labels help improve in UiPath Communications Mining?
A. Label bias warnings.
B. Increases data security.
C. Increases the taxonomy coverage.
D. Label precision and recall.
In an analytics-focused model in UiPath Communications Mining, which factor determines the extent to which a model trainer should optimize their model's performance?
A. The trainer's discretion is based on use case objectives and capacity to continue training.
B. There are at least 200 labels with at least a 70% MAP.
C. There are at least 150 labels with at least an 80% MAP.
D. The proportion of data that has been annotated is at least 10% of total dataset volumes.
When creating a training dataset, what is the recommended number of samples for a Column field?
A. 10-20 document samples per column field.
B. 20-50 document samples per column field.
C. 50-200 document samples per column field.
D. 100-200 document samples per column field.
What is an out-of-the-box Machine Learning Package?
A. A newly created ML Package.
B. An uploaded ML Package from the local machine as a .zip file with all the necessary files and metadata needed to train and serve a machine learning model.
C. An ML Package that is reused.
D. An ML Package created by UiPath or the Open Source community.
Which activity from the UiPath.IntelligentOCR.Activities Package allows you to retrieve text from PDF or image files?
A. Data Extraction Scope activity.
B. Present Classification Station activity.
C. Classify Document activity.
D. Digitize Document activity.
What can the Semantic Similarity out-of-the-box model be used for?
A. Understand sentiment in product reviews, customer surveys, social media posts, and emails.
B. Relate customer questions to FAQ documents and automatically pull responses from these documents.
C. Classify text in resumes, emails, web pages, and other formats.
D. Extract and classify text in emails, letters, web pages, research papers, and call transcripts.
What is the page unit cost per extracted page for the ML Extractor?
A. 0
B. 0.2
C. 0.5
D. 1