- Hands-On Predictive Analytics with Python
- Alvaro Fuentes
- 214字
- 2025-04-04 15:18:27
CRISP-DM and other approaches
Another popular framework for doing predictive analytics is the cross-industry standard process for data mining, most commonly known by its acronym, CRISP-DM, which is very similar to what we just described. This methodology is described in Wirth, R. & Hipp, J. (2000). In this methodology, the process is broken into six major phases, shown in the following diagram. The authors clarify that the sequence of the phases is not strict; although the arrows indicate the most frequent relationships between phases, those depend on the particularities of the project or the problem being solved. These are the phases of a predictive analytics project in this methodology:
- Business understanding
- Data understanding
- Data preparation
- Modeling
- Evaluation
- Deployment

There are other ways to look at this process; for example, R. Peng (2016) describes the process using the concept of Epicycles of Data Analysis. For him, the epicycles are the following:
- Develop expectations
- Collect data
- Match expectations with the data
- State a question
- Exploratory data analysis
- Model building
- Interpretation
- Communication
The word epicycle is used to communicate the fact that these stages are interconnected and that they form part of a bigger wheel that is the data analysis process.