Monday, July 13, 2009

July 2009 Visualisation ... 7 Key Challenges we face

The Digital Humanities Observatory is running a week long series of workshops on TEI, XSLT, Data Modelling and Data Visulisation from Monday the 13th to Friday the 17th of July. As part of this I was invited to present a lecture to all the workshop participants. I entitled my talk "Visualisation as an analytical tool, from networks to data streams. 7 Key Challenges we face." Michael Maguire gave a very flattering description of my talk on his blog (thanks).

As I said during my talk, I normally give such a talk as an introductory session to information visualisation or visual analytics. However, this time I structured my talk around what I see as the 7 key challenges we (or anyone interested in visualising data) face. This blog post is a summary of the 110+ slides I presented (sans examples and mathematics!).

The ideas I presented are my view on the world of information visulisation and visual analytics. The key challenges were not presented in order of importance (as their relative importance is problem or domain dependent). There are also a number of challenges I personally feel (including multi-device and small screen visulisation) are crucial but I realise are not as pressing as the mainstream issues people face.

My ideas are informed by my ongoing research in InfoVis and from keynotes, lectures, online talks, toolkits and blogs that I've read or seen. Useful (and insightful) sources include, the visualizeit blog, the infosethetics blog by Andrew in the University of Sydney, the keynote Peter Eades gave at InfoVis 2006, the keynote Christian Chabot gave at the IEEE VAST 2008 and the ideas I could glean online from the VisWeek 2008 Panel on Grand Challenges for Information Visualization. If I've missed anything you feel is important do let me know!

So the 7 key challenges I see include:
  1. Empower: We must ensure the person using visualisation to understand data is empowered to gain insight or save time etc. To achieve this focus (long and hard) on identifying the questions that you need to answer with your visulisation. Do not just think about the data. If you think you have tool, method or technique to help empower a person (yourself or another) to gain insight or save time, can you validate this? What validation methods can you employ to ensure you are not just toying with pretty pictures?
  2. Connect: Ensure, based on the question at hand, you help the person using the visualisation build a connection between the data and any processing/analysis and the visual form presented. The question at hand and hence data drives what is an appropriate visualisation. Also, if you are using a particular visual form (eg. maps) how far can you stretch the metaphor or connection between data and display, before it breaks?
  3. Volume: Ensure if the data needed to help answer the question at hand has many elements that your visualisation method, tool or technique can support this. Voluminous datasets can break many desktop tools simply due to the time/memory/bandwidth needed to "load" the dataset. There are many sources of data with numerous individual elements to consider, 304,059,724 people in the USA (sources US Census Bureau) data on age, gender, ethnicity, household make up, home structure, income, farms, business and sales available. In July 2008 Google found 1 trillion (1,000,000,000,000) unique URLs on the web at once. This is ever increasing with user generated and automatically created content. One of our recent studies on extracting social networks from non-social network data started with 9,468,460 one-way flight passenger records. Clearly there are large datasets one might be faced with. Another problem (often overstated) is the dimensionality of the data (each element having multiple attributes to consider).
  4. Heterogeneity: Ensure if the data needed to help answer the question at hand consists of heterogenous data from multiple different sources or of “variant types” that your visualisation method, tool or technique can support this. If you need to consider a heterogenous data space then ensure the data-sets interlock so coupled or co-ordinated views are meaningful (and possible to display).
  5. Audience:Suit the word (display) to the audience. Ensure you match the visualisations to your questions and your audience. Know your user and don’t explore visualisation questions in a bubble. Engage and explore! Some methods, tools and techniques do not suit particular audiences. "You haven’t made impact with visual analytics until you help people with their own data" and I would add to this "in the particular sociotechnical context where they will use your tools, 
methods or techniques".
  6. Dynamism: Data isn’t static. Ensure if the data needed to help answer the question at hand is a live source or the display is expected highlight changes over time that your visualisation method, tool or technique can support this.
  7. Discovery: Discover the new world once!: Ensure that your tools can store and capture and automate the process of pattern identification for subsequent data exploration. Convert identified patterns into “alerts” or stepwise mining, analysis, query and refinement into workflow.
As this was a masterclass I went on to point out the 10hr - 100,000hr guidelines to move from Trainee to Mastering visualisation. Gladwell spoke of the 10,000hr rule in his book Outliers which is important to consider when being introduced to a new topic like this. I pointed this out so people could help benchmark their own knowledge and skill level. My talk contained an introduction to cartography and GIS, multi-dimensional visualisation, parallel coordinates, paired parallel coordinates, graph drawing, force directed layouts and treemaps. As I'm currently learning iPhone application development myself, I know how dangerous and presumptive one can become moving from "Trainee" to "Apprentice" levels of knowledge. As they say, a little knowledge is a dangerous thing! I very much enjoyed giving this lecture to this DHO masterclass. I also gained some great insights with the director of the DHO Susan Schreibman on her experiences being a director there. Thanks to Shawn Day and Paolo Battino for inviting me to come along.

Saturday, July 11, 2009

July 2009 Upcoming Conferences of interest.

My upcoming move has me thinking about conferences in areas of interest to the HITLab Australia.

The International Symposium on Wearable Computers 2009 will be held in Sept in Linz Austria. I aim to attend ISWC 2009 as well as visiting Vodafone research in Munich enroute to meet some colleagues and one of my students undertaking an internship there.

The Eighth International Conference on Pervasive Computing 2009 will be held in May in Helsinki Finland. Along with my role as workshop co-chair I'm planning with some of my students and colleagues to submit some papers. Pervasive is one the premier events showcasing state of the art research in Pervasive Computing. It's a very good event to attend both to understand the developments within our field but also to engage the entire research community through workshops, demos, posters etc.

Along with colleagues I ran a Workshop on designing multi-touch interaction techniques for coupled public and private displays at AVI 2008 in Naples. AVI 2010 the biannual 10th International Working Conference on Advanced Visual Interfaces will be held in Rome in May 2010. I intend to run another workshop with colleagues to follow up on PPD08 along with submitting research papers based on our current and ongoing research.

Due to prior travel commitments one conference I cannot attend but would like to is this year's International Symposium on Mixed and Augmented Reality (ISMAR 2009) in Florida from October 19-23. I do however hope to attend ISMAR 2010 in Korea.

Thursday, July 02, 2009

July 2009 Elevation to BCS Fellow

The British Computer Society has informed me that I am now a Fellow of the BCS, specifically a chartered fellow (FBCS CITP) of the BCS. Thanks to Professor Dobson for being my supporter on this.