Information visualization

Max points
25

Test

Max points
25

Report

Max points
50

Project: Linked scatterplots

Example data: car.csv. The first row of the file contains column names (data attributes). The subsequent rows contain one data record each. Values are separated by commas (","). Decimal point is "."

The mandatory features of the program (25 points total):

  • user can import data from CSV file
  • user can open multiple scatterplot windows
  • user can specify for each window which data attributes are mapped to X and Y axes.
  • user can perform rectangular brush in a window
  • selected records are highlighted over all windows

Optional features: (25 points total)

  • additional rectangular brushes can be added with AND or OR operator (10 points)
  • user can adjust the extents of every brush (i.e. the sides of the rectangle) and the selection is re-calculated (15 points)
Max points
50

Project: Parallel coordinates

Example data: car.csv. The first row of the file contains column names (data attributes). The subsequent rows contain one data record each. Values are separated by commas (","). Decimal point is "."

The mandatory features of the program (25 points total):

  • user can import data from CSV file
  • program contains one window with parallel coordinates plot, initially with all the data attributes from the data file.
  • axes have names and min/max values displayed in the plot
  • axes can be deleted, added and the order of the axes can be changed.
  • axis orientation can be flipped (i.e. max is down, min is up)
  • user can select data by brushing an interval on an axis, selected data are highlighted

Optional features: (25 points total)

  • additional interval brushes can be added on any axis with AND or OR operator (10 points)
  • user can adjust the extents of every brush (i.e. the min/max values of the interval) and the selection is re-calculated (15 points)
Max points
50

Project: Theme river

Download baby name data from: http://www.ssa.gov/oact/babynames/names.zip

Implement a theme river visualization of the popularity of baby names over the years in the data.

Mandatory program features (20 points)

  • Filter the data by sex
  • Additional filter by top 10 per each year or by top 10 overall
  • Zoom the view to a certain time interval
  • Theme river graph uses polygonal shapes

Optional program features (30 points):

  • Theme river graph uses spline/curve shapes (10 points)
  • Name can be selected by clicking on its 'stream' and the stream is highlighted. (10 points)
  • Display name on mouse hover over the stream. (10 points)
Max points
50

Project: Clustering with radial tree

Example data: car.csv. The first row of the file contains column names (data attributes). The subsequent rows contain one data record each. Values are separated by commas (","). Decimal point is "."

The mandatory features of the program (40 points total):

  • user can import data from CSV file
  • user can specify which data attributes are used for the similarity (distance) calculation
  • program performs hierarchical clustering using the nearest neighbor algorithm (or single linkage)
  • program displays the hierarchy in the form of a node-link diagram with radial tree layout (without text labels)

Optional feature (10 points)

  • user can specify which data attribute is used as name and the names of leaves are displayed as text labels of the leaf nodes in the diagram (10 points)