- Modern Big Data Processing with Hadoop
- V. Naresh Kumar Prashant Shindgikar
- 156字
- 2025-04-04 17:12:20
Data structure analysis
Once data is acquired, we have to understand the structure of the data. Remember that the data we are getting can be in any of the following forms:
- Text data:
- Structured data
- Unstructured data
- Binary data
This is where we need certain tools to help us understand the structure of the data.
Once we have a thorough understanding of the data we are dealing with, the next task is to understand the bits and pieces we need to extract from this structure. Sometimes, depending on the complexity and size of the data we are dealing with, it might take time for us to really find and extract the information we are looking for.
Once we know what we are looking for and also have a solid understanding of the structure of the data, it becomes easier for us to come up with simple algorithms to extract the required information from the input data.