Profit Driven Business Analytics. Baesens Bart
Чтение книги онлайн.
Читать онлайн книгу Profit Driven Business Analytics - Baesens Bart страница 8
a. Customer lifetime value estimation is an example of classification.
b. Demand estimation is an example of classification.
c. Customer churn prediction concerns regression.
d. Detecting fraudulent credit-card transactions concerns classification.
Question 4
Which is not a characteristic of a good data scientist? A good data scientist:
a. Has a solid business understanding.
b. Is creative.
c. Has thorough knowledge on legal aspects of applying analytics.
d. Excels in communication and visualization of results.
Question 5
Which statement is true?
a. All analytical models are profit-driven when applied in a business setting.
b. Only predictive analytics are profit-driven, whereas descriptive analytics are not.
c. There is a difference between analyzing data for the purpose of explaining or predicting.
d. Descriptive analytics aims to explain what is observed, whereas predictive analytics aims to predict as accurately as possible.
Question 1
Discuss the difference between a statistical perspective and a business perspective toward analytics.
Question 2
Discuss the difference between modeling to explain and to predict.
Question 3
List and discuss the key characteristics of an analytical model.
Question 4
List and discuss the ideal characteristics and skills of a data scientist.
Question 5
Draw the analytics process model and briefly discuss the subsequent steps.
REFERENCES
Agrawal, R., and R. Srikant. 1994, September. “Fast algorithms for mining association rules.” In Proceedings of the 20th international conference on very large data bases, VLDB (Volume 1215, pp. 487–499).
Athanassopoulos, A. 2000. “Customer Satisfaction Cues to Support Market Segmentation and Explain Switching Behavior.” Journal of Business Research 47 (3): 191–207.
Baesens, B. 2014. Analytics in a Big Data World: The Essential Guide to Data Science and Its Applications. Hoboken, NJ: John Wiley and Sons.
Baesens, B., V. Van Vlasselaer, W. Verbeke. 2015. Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques: A Guide to Data Science for Fraud Detection. Hoboken, NJ: John Wiley and Sons.
Bhattacharya, C. B. 1998. “When Customers Are Members: Customer Retention in Paid Membership Contexts.” Journal of the Academy of Marketing Science 26 (1): 31–44.
Breiman, L. 2001. “Statistical Modeling: The Two Cultures.” Statistical Science 16 (3): 199–215.
Cao, B. 2016. “Future Healthy Life Expectancy among Older Adults in the US: A Forecast Based on Cohort Smoking and Obesity History.” Population Health Metrics, 14 (1), 1–14.
Chakraborty, G., P. Murali, and G. Satish. 2013. Text Mining and Analysis: Practical Methods, Examples, and Case Studies Using SAS. SAS Institute.
Coussement, K. 2014. “Improving Customer Retention Management through Cost-Sensitive Learning.” European Journal of Marketing 48 (3/4): 477–495.
Dejaeger, K., W. Verbeke, D.Martens, and B. Baesens. 2012. “Data Mining Techniques for Software Effort Estimation: A Comparative Study.” IEEE Transactions on Software Engineering 38: 375–397.
Elder IV, J., and H. Thomas. 2012. Practical Text Mining and Statistical Analysis for Non-Structured Text Data Applications. Cambridge, MA: Academic Press.
Han, J., and M. Kamber. 2011. Data Mining: Concepts and Techniques. Amsterdam: Elsevier.
Hand, D. J., H. Mannila, and P. Smyth. 2001. Principles of Data Mining. Cambridge, MA: MIT Press.
Hyndman, R. J., A. B. Koehler, J. K. Ord, and R. D. Snyder. 2008. “Forecasting with Exponential Smoothing.” Springer Series in Statistics, 1–356.
Peto, R., G. Whitlock, and P. Jha. 2010. “Effects of Obesity and Smoking on U.S. Life Expectancy.” The New England Journal of Medicine 362 (9): 855–857.
Shmueli, G., and O. R. Koppius. 2011. “Predictive Analytics in Information Systems Research.” MIS Quarterly 35 (3): 553–572.
Tan, P. – N., M. Steinbach, and V. Kumar. 2005. Introduction to Data Mining. Reading, MA: Addison Wesley.
Van Gestel, T., and B. Baesens. 2009. Credit Risk Management: Basic Concepts: Financial Risk Components, Rating Analysis, Models, Economic and Regulatory Capital. Oxford: Oxford University Press.
Verbeke, W., D. Martens, and B. Baesens. 2014. “Social Network Analysis for Customer Churn Prediction.” Applied Soft Computing 14: 431–446.
Verbraken, T., C. Bravo, R. Weber, and B. Baesens. 2014. “Development and Application of Consumer Credit Scoring Models Using Profit-Based Classification Measures.” European Journal of Operational Research 238 (2): 505–513.
Widodo, A., and B. S. Yang. 2011. “Machine Health Prognostics Using Survival Probability and Support Vector Machine.” Expert Systems with Applications 38 (7): 8430–8437.
CHAPTER 2
Analytical Techniques
INTRODUCTION
Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 % of the data in the world has been created in the last two years. These massive amounts of data yield an unprecedented treasure of internal knowledge, ready to be analyzed using state-of-the-art analytical techniques to better understand and exploit behavior about, for example, your customers or employees by identifying new business opportunities together with new strategies. In this chapter, we zoom into analytical techniques. As such, the chapter provides the backbone for all other subsequent chapters. We build on the analytics process model reviewed in the introductory chapter to structure the discussions in this chapter and start by highlighting a number of key activities that take place during data preprocessing. Next, the data analysis stage is elaborated. We turn our attention to predictive analytics and discuss linear regression, logistic regression, decision trees, neural networks, and random forests. A subsequent section elaborates on descriptive analytics such as association rules, sequence rules and clustering. Survival analysis techniques are also discussed, where the aim is to predict the timing of events instead of only event occurrence. The chapter concludes by