How does knowledge emerge from data?
What new tools are available to manipulate and interpret data?
"Informatics"the study of the structure of information itself, rather than of what information conveysis a growing trend in analytics that has given rise to everything from bioinformatics to legal informatics. Although it may be at the heart of hypothesis generation one day, computer analysis of information is still immature, and so informatics is still a tool rather than a driver of science. Nevertheless, it's providing scientists with new ways to collaborate, chart the shape of knowledge, and connect dots in their thinking. It also helps scientists identify the "most promising items to read, work on, and work with," says informaticist Katy Börner.
By aggregating information from users across the world on all life forms, the online Encyclopedia of Life project hopes to map vectors of human disease, reveal mysteries behind longevity, and foster strategies to slow invasive species. Essentially, EoL will be a microscope in reverseor “macroscope”helping scientists everywhere to discern large-scale patterns.
Katy Börner, associate professor of Information Science
and head of the Information Visualization Laboratory at Indiana University
Why is informatics important for science and innovation?
Informatics supports decision-making. In many cases people have to deal with vast amounts of information, streaming toward them in real time. They have to pick the most promising elements, interconnect them, and contribute back to this stream of knowledge and innovation. Informatics research is ultimately the study of interfaces to databases. It helps scientists analyze and communicate raw data. Information science is also integral to how we build the economy which is now more and more knowledge-oriented and innovation-driven.
Could the tools of informatics and visualization supplant old methods of analysis?
The new field of data mining or network science is far away from the rigor and maturity of statistics. I don't know how many in my field truly understand the tools they use. Nobody except the inventors for many of these toolssuch as Google's search systemknows the exact algorithm used. But how these tools work, how we access all that we collectively know, will impact our thinking and how we use knowledge. It's mysterious, yet we all still use it. As for visualization, some people take the time to understand what it's doing; they can understand vastly new ways of manipulating and depicting information.
What are the major challenges facing your field?
For the mapping of science specifically, I think we need more standards. It would be good to create an infrastructure confederating different databases, with commonly agreed-upon algorithms and tools to be used across the field. Proprietary data and proprietary tools make it very hard to compare different efforts. For the field of network science or data visualization, we should try to create a "macroscope." We need a powerful tool to reveal patterns, trends, and outliers in the streams of data flooding us, a tool to help us gain insight into the data. Because for computer science and informatics in general, there is the human element. In many cases, we build hardware and software and believe people will use itif we have built it, they will comebut I think in many cases you really have to first do some major social-technical engineering, which is a major challenge. Interviewed by TJ Kelleher
The Fundamentals: Informatics
Posted November 20, 2008
Originally appeared in Seed 19