iVia Notes on General Visualization, Graph Visualization or Interface:

Tools:

JUNG ****

the Java Universal Network/Graph Framework--is a software library that provides a common and extendible language for the modeling, analysis, and visualization of data that can be represented as a graph or network. The JUNG architecture is designed to support a variety of representations of entities and their relations, such as directed and undirected graphs, multi-modal graphs, graphs with parallel edges, and hypergraphs. It provides a mechanism for annotating graphs, entities, and relations with metadata. The current distribution of JUNG includes implementations of a number of algorithms from graph theory, data mining, and social network analysis, such as routines for clustering, decomposition, optimization, random graph generation, statistical analysis, and calculation of network distances, flows, and importance measures (centrality, PageRank, HITS, etc.). Also see InfoVis Toolkit. http://jung.sourceforge.net/

Tulip ****

For visualizing very large graphs using node-link diagrams... cope. with hundreds of thousands of nodes. This supports many diverse visualizations and is open source.
http://www.tulip-software.org

SPIRE and IN-SPIRE ****

http://in-spire.pnl.gov/about.html
free to fed gov projects ($500 fee) visualized crawls; supports exploration of textual data, including Boolean and .topical. queries, term gisting, and time/trend analysis tools; interactive up to 30k-60k 1 pages docs (might = bib rec + desc); UNIX and Windows flavors; pretty impressive; zoom (galaxy > topic terrain); reads ASCII or XML (we can convert). Pretty sophisticated. Beautiful. May be too experimental. Free trial. Taxonomy-Based Classification and Visualization of Text
http://www.emsl.pnl.gov/docs/cis/annual_report1999/1558b-3c.html

Starlight ****

http://starlight.pnl.gov/
U.S. Gov or sponsored org. We would qualify. Go to applications then go to Web maps whereupon we get a nice visualization of a focused crawl accompanied by the following: The image above shows a Starlight Network View representation of the "Information Visualization" Web community. This collection of Web pages was harvested using the Google search engine to retrieve the top 100 ranked pages containing the phrase "information visualization," as well as all pages that link to, or are linked to from, the pages in the query result set. The entire collection consists of approximately 1500 pages. In this view, nodes represent discrete Web pages, and edges represent hyperlink references among the pages. The pages are color-encoded according to their link class, with pages in the original result set colored green, result set inlinks colored blue, and result set outlinks colored yellow. Note that the majority of the result set pages are embedded in the clique (dodecahedra) and cluster (stellated dodecahedra) nodes. ........ Starlight is currently only available to U.S. Government and Government-sponsored organizations. If your organization meets this qualification, please contact us for licensing and availability information. Be advised that, as Starlight users, you and your colleagues will be leaving terra firma behind, and joining us out on the edge of possibility. Or something like that.

InfoVis Toolkit ****

http://www.lri.fr/~fekete/InfovisToolkit
The InfoVis Toolkit, as of version 0.6, implements eight (8) types of visualization: Scatter Plots, Time Series and Parallel Coordinates for tables; Node-Link diagrams, Icicle trees and Treemaps for trees; Adjacency Matrices and Node-Link diagrams for graphs. Naomi likes this one.

From InfoVis author Jean-Daniel Fekete <Jean-Daniel.Fekete@inria.fr> :

"I received inspiration for my data structure from LEDA graphs and the C++ BOOST Graph Library. They don't use "objects" for storing graphs but simpler containers. I just used their architecture and generalized it to tables, trees and graphs. The performance and memory footprint is much better this way than using objects for edges and vertices. Any number of attributes can be associated with vertices and edges in that way, and these attributes can be fetched very quickly and in a type-safe manner. I haven't looked at your implementation of attributes associated with edges and vertices but the trivial implementation using a vector of objects is both very slow and very memory intensive (I have made some comparisons in a tech report http://www.inria.fr/rrrt/rr-4818.html).

As for actual numbers, the Toolkit can work with million of vertices and edges but the visualization is slow above 10,000 items. This will be greatly improved with Java 1.5 when the OpenGL-based implementation of Graphics2D is available. We have implemented such a beast at the Univ. of Maryland (it is called Agile2D http://www.cs.umd.edu/hcil/agile2d/index.shtml) but Sun is currently working on a standard cross-platform implementation that will provide the speed required to visualize hundred of thousands of items at interactive speed I guess. Currently, the InfoVis Toolkit relies on executable programs to do the graph layouts (the GraphViz programs: dot, neato and twopi). Using Jung, it could be done entirely inside the toolkit with greater control. So far, the toolkit is distributed under the Q Public License, which is like the GPL but forbids forks so I can control the development until I feel it is sufficiently stable to be left alone. I would then turn it to a BSD-like license.

Don't forget that the Toolkit is a work in progress. However, it is used by some quite heavy packages and seem to stand the load so I am quite confident about its structure and extensibility. I think we should try to merge our efforts if possible to avoid fragmentation and focus on innovation and improvements instead of repetition."

Agile2D (based on InfoVis)

http://www.cs.umd.edu/hcil/agile2d/index.shtml

InfoVis CyberInfrastructure ****

http://iv.slis.indiana.edu/
This web site provides access to different software packages easing the exploration, modification, comparison, and extension of data mining and information visualization algorithms. Diverse software packages were bundled into learning modules. Links to diverse databases, compute resources, and references are provided as well. This is a compendium of resources/papers/software of value in getting an overview.

Treemap ****

http://www.cs.umd.edu/hcil/treemap Treemap is a space-constrained visualization of hierarchical structures. It is very effective in showing attributes of leaf nodes using size and color coding. Treemap enables users to compare nodes and sub-trees even at varying depth in the tree, and help them spot patterns and exceptions. Open for usage by higher ed. Not sure if it will scale to a crawl visualization.

Pajek

http://vlado.fmf.uni-lj.si/pub/networks/pajek/
large network analysis.compared with JUNG often

Walrus:

Walrus is a tool for interactively visualizing large directed graphs in three-dimensional space. By employing a fisheye-like distortion, it provides a display that simultaneously shows local detail and the global context. Walrus is a tool for interactively visualizing large directed graphs in three-dimensional space. It is technically possible to display graphs containing a million nodes or more, but visual clutter, occlusion, and other factors can diminish the effectiveness of Walrus as the number of nodes, or the degree of their connectivity, increases. Thus, in practice, Walrus is best suited to visualizing moderately sized graphs that are nearly trees. A graph with a few hundred thousand nodes and only a slightly greater number of links is likely to be comfortable to work with. http://www.caida.org/tools/visualization/walrus/

H3Viewer

http://graphics.stanford.edu/~munzner/h3/
The H3Viewer library provides layout and interactive navigation of node-link graphs in 3D hyperbolic space. The library can handle quite large graphs of over to 300,000 edges quickly and with minimal visual clutter. The hyperbolic view allows the user to see a great deal of the context around the current focus node. Walrus is the Java3D version.

GGobi

http://www.ggobi.org/
GGobi is a data visualization system for viewing high-dimensional data and is the next edition of xgobi. It provides a new interface to many of the features of xgobi, built using Gtk, the GIMP toolkit.

OpenDX !!!

http://opendx.org/

VisAD

VisAD is a Java component library for interactive and collaborative visualization and analysis of numerical data. The name VisAD is an acronym for "Visualization for Algorithm Development".
http://www.ssec.wisc.edu/~billh/visad.html

Graphviz

http://www.research.att.com/sw/tools/graphviz/examples/
http://www.graphviz.org/
Undirected graphs -- intranet layout example

Open Source Graph or Network Visualization Written in Java

http://www.manageability.org/blog/stuff/open-source-graph-network-visualization-in-java/view
several packages

Graph Interface Library: GINY

http://csbi.sourceforge.net/index.html open source

Visualization ToolKit (VTK)

http://www.vtk.org/index.php

Piccolo

http://www.cs.umd.edu/hcil/jazz/

Info Viz Toolkit

http://www.lri.fr/~fekete/InfovisToolkit/

TANAGRA

produces decision graphs and trees. Includes several classification methods. (Win). Shareware http://eric.univ-lyon2.fr/~ricco/tanagra/en/tanagra.html

LEDA ****

http://www.algorithmic-solutions.com/enleda.htm Library of efficient data types and algorithms (LEDA) contains all of the relevant algorithm building blocks in an easy-to-use and efficient form dealing with objects such as graphs, sequences, dictionaries, trees, points, flows, matchings, segments, and shortest paths.

GDToolkit

http://www.dia.uniroma3.it/~gdt/

AGD

http://www.ads.tuwien.ac.at/AGD/
A LEDA-based library of C++ classes for graph drawing.


Link Lists:

Information Visualization Resources on the Web http://graphics.stanford.edu/courses/cs348c-96-fall/resources.html
(also see InfoVis Cyberstructure above for overview of research/software)


Papers:

Topical clustering, summarization and visualization http://www-nlp.stanford.edu/courses/cs224n/2003/fp/millersj/cs224nfp.pdf
"The growing collection of academic papers available on the web provides a terrific resource for researchers. With this volume comes a need for organization, a task which CiteSeer and other sites have attempted to tackle. We set out to extend the functionality provided by CiteSeer through an investigation of clustering by topic similarity, experimenting with visualization and summary of clustering results. We had three main goals in mind. First, we hoped to develop a sensible clustering of the articles based on textual similarity. In doing so, we ended up using CiteSeer.s similarity results as partial input to our clusterer. Second, we hoped to provide a meaningful visualization of some of the topics represented in the article corpus. Finally, we wanted to investigate the possibility of automatically naming each cluster given the features present in each article and the article.s similarity to the closest centroid. With such a large pool of articles, we felt that automatic generation would be preferable to a hand-crafted taxonomy." May not scale. Some interesting notions.

Mapping topics and topic bursts in PNAS
Ketan K. Mane and Katy Börner
PNAS, April 6, 2004; 101 (Suppl. 1)
http://www.pnas.org/content/vol101/suppl_1/
Scientific research is highly dynamic. New areas of science continually evolve; others gain or lose importance, merge, or split. Due to the steady increase in the number of scientific publications, it is hard to keep an overview of the structure and dynamic development of one's own field of science, much less all scientific domains. However, knowledge of "hot" topics, emergent research frontiers, or change of focus in certain areas is a critical component of resource allocation decisions in research laboratories, governmental institutions, and corporations. This paper demonstrates the utilization of Kleinberg's burst detection algorithm, co-word occurrence analysis, and graph layout techniques to generate maps that support the identification of major research topics and trends. The approach was applied to analyze and map the complete set of papers published in PNAS in the years 1982-2001. Six domain experts examined and commented on the resulting maps in an attempt to reconstruct the evolution of major research areas covered by PNAS.

Visualization of semantic metadata and ontologies. Paul Mutton and Jennifer Golbeck. In Seventh International Conference on Information Visualization (IV03), pages 300-305. IEEE, July 2003.
http://www.cs.kent.ac.uk/pubs/2003/1655/index.html


Parameters to Visualize Expert Interactions in the Crawl: Help in Lifting and Distillation

Persona: A Contextualized and Personalized Web Search
http://www.hicss.hawaii.edu/HICSS_35/HICSSpapers/PDFdocuments/DTDMI01.pdf

H, Chang, et al
Creating Customized Authority Lists
http://citeseer.ist.psu.edu/chang99creating.html

PEBL: Positive Example Based Learning for Web Page (Make Corrections) Classification
Using SVM Hwanjo Yu Department of Computer Science University...
http://citeseer.ist.psu.edu/564419.html