Enterprise AI For Dummies. Zachary Jarvinen
Чтение книги онлайн.
Читать онлайн книгу Enterprise AI For Dummies - Zachary Jarvinen страница 14
For example, to approve a loan application, a loan officer considers income, age, net worth, and many other factors. Each attribute of the application is a rule or factor that the officer must evaluate to approve or reject the loan. Machine-learning techniques follow a similar process, comparing various attributes, historical decisions, and the outcome of similar applicants to estimate the credit worthiness of the new applicant. Table 1-3 shows how machine learning is like a recipe.
Prediction
In the prediction stage, the system uses the model to process new data (not historical data), detect patterns and trends, and attempt to match them to patterns from the learning data.
TABLE 1-3 Machine Learning as a Recipe
Machine Learning | Recipe | |
Task | An algorithm is a step-by-step instruction set or formula for solving a problem or completing a task. | Thaw the chicken. Season the chicken. Bake the chicken at 350°F. |
Objective | Minimize errors (loss function) to attain the best approach to solve a task. | Minimize the number of ingredients and steps required to prepare a tasty dish. |
Insight/result | The algorithm learns from errors, finds the best approach, and generates insights and rules used to make predictions. | Learn from your mistakes the next time you attempt the recipe. |
For example, if you process a brochure for the San Diego Zoo using the model, it would recognize the content about elephants and add the tag “elephant” to the document along with a score. The result is a prediction in the form of the percentage probability that the document contains information about elephants. Basically, the model makes a data-driven guess.
In AI and data science, execution is not just implementing a plan. The methodology establishes an iterative process of learning, discovering, and then acting based on new information as opposed to a more traditional IT model of formulating a plan or idea and then rolling it out as planned.
Auto-classification
Auto-classification is a machine-learning technique that automates tedious, error-prone tasks such as classifying information for storage and retrieval or answering a question. In a world where the amount of information stored digitally is expected to double every two years well into the next decade, auto-classification makes the difference between using that information and being overwhelmed by it.
Auto-classification uses two machine-learning methods, supervised classification and unsupervised classification, for two different purposes.
Supervised classification
Machine learning via supervised classification uses exemplars of known document types to classify new documents in a two-step process:
1 Train the algorithm using known, manually classified content.
2 Classify new content using the trained algorithm.
In a stable content environment, AI teams use supervised classification to set custom classification models specific to a particular application or organization. This method requires human intervention to select the training data and optimize the model, and thus requires substantial involvement and effort in the early phases of the project, but yields predictable, accurate results.
Unsupervised classification
Machine learning via unsupervised classification uses clustering and association algorithms to discover relationships in a heterogeneous dataset:
Clustering algorithms identify commonalities in the data, such as textual content or data format, and extrapolate relationships to create natural groupings and detect anomalous elements, such as security threats or medical issues.
Association algorithms reveal interesting relationships in the data to answer questions to address issues such as reducing customer churn or selecting related products for a promotion.
AI teams use unsupervised classification when attempting to answer these types of questions:
Is there any evidence of fraud in these financial transactions?
Are there any network performance symptoms that indicate a latent issue that would increase the risk of network failure?
Are there any anomalies in customer activity that point to possible buying trends?
Predictive analysis
Predictive analysis uses data mining, machine learning, and predictive modeling to process transactional and historical data to identify trends that indicate areas of increased risk or reward.
Specifically, predictive modelling software uses known results from existing data to train the model to predict relationships and outcomes that are likely to occur in future data and recommend a course of action. It is a business function, not a math problem or a science exercise.
AI teams use predictive analytics when attempting to answer these types of questions:
Will my customer purchase product X?
Will my customer like a recommended song?
Which of my customers are likely to switch to a competitor or cancel their contract?
Of all recently submitted claims, which ones are likely to require an additional fraud investigation unit review?
Is this applicant likely to default on their car loan in the future?
Because predictive analytics delivers actionable insight, in-depth knowledge in the business domain is as important as an understanding of the various analytics techniques or the ability to code analytics solutions.
For example, predictive analytics can spot buying trends and patterns, but it takes someone with an understanding of the market to help the software interpret them and assess their relevance.
Predictive analysis is used in a wide range of markets:
Manufacturing and logistics operations apply predictive maintenance to ensure maximum performance and uptime for their assets.
Financial services and retail organizations use predictive analytics