Conceptual Questions
1
.
How can you measure the effectiveness of data governance in an organization, and can you quantify it?
2
.
What is the relationship between data governance and data quality?
3
.
Should the age of data be a factor in the quality of data?
4
.
We typically use the “pyramid of knowledge” to relate to the various level of abstractions encountered when managing data. These levels are known (going from top to bottom) as data, information, knowledge, and wisdom. Can you explain why we use a pyramid to represent the levels of abstraction? Give a specific example of a pyramid starting with actual data. Can you explain the thinking process that makes it possible to go from one level to another?
5
.
Why would it be easier to scale unstructured data than structured data?
6
.
Use a free trial version of ErWin to put together a universe of discourse (UoD) for a conceptual relational model involving the following entities: EMPLOYEE, DEPARTMENT, PROJECT, WORKS_ON.
7
.
Both NFNFs and ODBs allow nested queries. Can you explain why ODBs are better at supporting deeply nested queries? (Hint: think about object identity.)
8
.
Draw a simple star schema for a problem involving product sales according to time, region, and price.
9
.
Explain the use of MapReduce on the simple example that consists of counting words in a very large document.
10
.
How can informatics be used to solve problems?