Australian Antarctic DivisionConditions of Use | Feedback & requests | About us | Site map

Australian Antarctic Data Centre

Analysis Tools

Analysis tools - main index

A suite of analysis tools utilising data from the Data Centre's repository and elsewhere.
About
Projects
Publications
Software

Data Mining

Random image
Average SeaWiFS-derived chlorophyll-a for southern hemisphere summer 1998/99 to 2004/05.

What is data mining?

Data mining:

Why are we interested in data mining?

Collecting data from polar regions is difficult and expensive. Through data mining, the Data Centre seeks to make the data that we hold more useful to the Antarctic community.

Data mining is used to enhance the value of the data held by the Data Centre in several ways:

Data Mining Projects

Current data mining projects

Past data mining projects

Publications

Request a copy of any of these publications.

Journal and conference publications

1.Williams G.D., Nicol S., Raymond B., Meiners K. (2008) Summertime mixed layer development in the marginal sea ice zone off the Mawson coast, East Antarctica Deep-Sea Research Part IIDetails
2.Lawton K., Kirkwood R., Robertson G., Raymond B. (2008) Preferred foraging areas of Heard Island albatrosses during chick raising and implications for the management of incidental mortality in fisheries Aquatic Conservation: Marine and Freshwater EcosystemsDetails
3.Martin-Smith K., O'Brien P., Raymond B., Constable A. (2007) Summary factsheets for bioregionalisation of the Southern Ocean - examples from the Indian Ocean Sector (Area 58) CCAMLR Workshop on Bioregionalisation. Brussels, Belgium, August 2007 Details
4.van den Hoff J., Burton H., Raymond B. (2007) The population trend of southern elephant seals (Mirounga leonina) at Macquarie Island (1952 - 2004) Polar BiologyDetails
5.Grant S., Raymond B. (2007) Data for bioregionalisation of the Southern Ocean CCAMLR Workshop on Bioregionalisation. Brussels, Belgium, August 2007Details
6.Rhodes M., Wardell-Johnson G.W., Rhodes M.P., Raymond B. (2006) Applying network analysis to the conservation of habitat trees in urban environments: a case study from Brisbane, Australia. Conservation BiologyDetails PDF (requires subscription)
7.Raymond B., Belbin L. (2006) Visualisation and exploration of scientific data using graphs. Data Mining, Lecture Notes in Computer ScienceDetails PDF (requires subscription)
8.Meiners K., Pasquer B., Raymond B. (2006) On the large-scale distribution of sea-ice algae off East Antarctica and the importance of sea-ice thickness and snow cover. International Workshop on Antarctic Sea Ice Thickness, Hobart, 5-7 July 2006.Details
9.Raymond B., Constable A., Sokolov S. (2006) Network approaches to marine regionalisation. Network Theory Working Group Meeting 3, CSIRO Sustainable Ecosystems, Canberra, June 2006.Details
10.Raymond B., Meiners K., Curran M., van Ommen T. (2006) A conceptual model of the large-scale distribution of sea ice algae off East Antarctica. Abstracts of the SCAR Open Science Conference, Hobart, 12-14 July 2006.Details
11.Raymond B., Constable A. (2006) Regionalisation of the Southern Ocean: A statistical framework. CCAMLR WG-EMM-06/37 Agenda Item 5 & 6.Details
12.Woehler E.J., Raymond B., Watts D.J. (2006) Convergence or divergence: where do short-tailed shearwaters forage in the Southern Ocean? Marine Ecology Progress SeriesDetails PDF
13.Grant S., Constable A., Raymond B., Doust S. (2006) Bioregionalisation of the Southern Ocean: Report of Experts Workshop (Hobart, September 2006)Details PDF and supplementary material
14.Raymond B., Hosie G., Woehler E. (2006) Structured graphs for visualisation and exploration of biodiversity data. 5th International Conference on Ecological Informatics, Santa Barbara, California, December 2006.Details
15.Burton H.R., Venegas S., Van den Hoff J., Raymond B., M Curran M. (2006) The annual numbers of Leopard Seals (Hydrurga leptonyx) sighted at Macquarie Island (over 56 years) are correlated significantly with periodic flux of sea-ice concentration south-east of the island. Abstract of the SCAR Open Science Conference, Hobart, 12-14 July 2006Details
16.Van den Hoff J., Burton H., Raymond B., Bester M. (2006) Long-term changes in the population status of Southern Elephant Seals (Mirounga leonina) at Macquarie Island, 1952-2005. Abstracts of the SCAR Open Science Conference, Hobart, 12-14 July 2006Details
17.Raymond, B., Hindell M., Worby T., Williams G., Meiners K., Hosie G., Adams N., Woehler E. (2005) Ecological Change in East Antarctica. Poster presented at International Workshop - Ecological change in East Antarctica - Stochastic Variability, Cycles or Regime Shifts? Hobart, 5-7 September 2005Details
18.Raymond B., Rhodes M., Wardell-Johnson G., Stark J. (2005) Network-based visualisation: exploring case studies of bat roost networks and benthic assemblages. Network Theory Working Group Workshop II, Canberra, March 8-9.Details PDF
19.Constable A.J., Candy S.J., Raymond B. (2005) Examination of the characteristics of the fishery for Dissostichus eleginoides in the CCAMLR statistical subarea 48.3 and its implications on estimating trends in catch per unit effort. CCAMLR WG-FSA-SAM-05/17Details
20.Constable A.J., Ball I., Raymond B., Candy S., Williams R., Dunn A. (2005) Evaluating methods to assess yield of patagonian toothfish (Dissostichus eleginoides) in CCAMLR Division 58.5.2. Details
21.Meiners K., Raymond B., Williams G., Massom R., Nicol S. (2005) A conceptual model of the large-scale distribution of sea ice algae off East Antarctica during the autumn-winter transition. Conference: Dynamic Planet, Cairns, August 22-26Details
22.Raymond B., Watts D.J., Burton H., Bonnice J. (2005) Data Mining and Scientific Data. Arctic, Antarctic, and Alpine ResearchDetails Abstract
23.Cunningham L., Raymond B., Snape I., Riddle M.J. (2005) Benthic diatom communities as indicators of anthropogenic metal contamination at Casey Station, Antarctica Journal of PaleolimnologyDetails PDF (requires subscription)
24.Raymond B., Belbin L. (2004) Visualisation and exploration of scientific data using graphs. Proceedings of the Third Australasian Data Mining Conference, December 2004, Canberra, AustraliaDetails PDF
25.Emmerson L., Raymond B., Southwell C. (2004) Modelling availability bias using existing time series count data: Adélie penguins as a case study. CCAMLR WG-EMM-04/54. Agenda Item No 6.1Details Abstract PDF
26.Raymond B., Belbin L., Stark J. (2004) Graphical methods for the exploration of ecological databases. 2004 Meeting of the Ecological Society of Australia, Adelaide, December 2004Details Abstract
27.Raymond B. (2004) Data mining: making the most of polar and oceanographic information in the 21st century. Proceedings of the 30th Annual Conference of the International Association of Aquatic and Marine Science Libraries and Information Centers, Hobart, September 2004Details Abstract
28.Raymond B., Woehler E.J. (2003) Predicting seabirds at sea in the Southern Indian Ocean. Marine Ecology Progress SeriesDetails PDF
Supplementary material
29.Woehler E.J., Raymond B., Watts D.J. (2003) Decadal-scale seabird assemblages in Prydz Bay, East Antarctica. Marine Ecology Progress Series.Details
30.Raymond B., Woehler E.J. (2002) Mining Antarctic scientific data: a case study Proceedings Australasian Data Mining Workshop 3rd December 2002, Canberra, AustraliaDetails PDF
31.Woehler E., Raymond B., Watts D. (2002) Long-term study analyses seabird communities. Australian Antarctic Magazine 4, Spring 2002Details PDF

Draft or in-press publications

  1. Raymond, B. and Hosie, G. (2008) Network-based exploration and visualisation of ecological data. Submitted
  2. Schwarz, J.N., Raymond, B., Williams, G., Marsland, S., Pasquer, B., Mongin, M., and Wright, S. (2008) Climatological anomalies in wind, sea surface temperature, sea-ice and chlorophyll concentrations during the BROKE-West survey. Submitted
  3. Woehler, E.J., Raymond, B., Boyle, A., and Stafford, A. (2008) The role of environmental determinants on seabird assemblages observed during BROKE West, January - March 2006. Submitted

Software

The Polar Toolbox is a collection of Matlab functions for various analyses of Antarctic data, but which might be useful for analyses of other data. Most of this code has been tested under Matlab 2008a. It may also be compatible with other applications, such as Octave, which is a freely-available, mostly-Matlab-compatible package.

All code here is experimental and made freely available. Before downloading you will need to register (free!) with the Australian Antarctic Data Centre. This allows us to direct our development attention to the most active files.

Download the toolbox

Queries?

Toolbox contents

% Polar Toolbox: A Matlab toolbox for various Antarctic analytical tasks
% Ben Raymond 
% May 2008
%
% aloc - Belbin's ALOC non-hierarchical clustering algorithm
% cellmax - the maxima of a cell vector of identically-sized matrices
% cellmean - the mean of a cell vector of identically-sized matrices
% cellstd - the SD of a cell vector of identically-sized matrices
% cluster_clara - clustering using R's CLARA function. Requires R and the matlabRlink toolbox to be installed
% cutree - cuts a dendrogram and return the group membership vector 
% dump_csv - write numeric matrix to comma separated text file
% easegrid_ll2rc - transforms latitude and longitude coordinates to row and column coordinates of an EASE ice data grid cell
% gebco_colormap - colormap similar to that used on GEBCO bathymetric charts
% get_ice - retrieves ice data from binary files as obtained from NSIDC/seaice.de/etc
% get_icemotion - retrieves ice motion data from binary files as obtained from NSIDC
% icemotion_easegrid_ll2rc - converts longitude/latitude coordinates to row/column coordinates in the ice motion EASE grid
% ndvi_colormap - colormap similar to that used to show normalised vegatation index images
% proplim - confidence interval of a proportion m/N
% ranks - ranks of x adjusted for ties
% slidingmean - applies a sliding mean to a vector
% slidingmedian - applies a sliding median to a vector
% spearman - computes Spearman's rank-order correlation coefficient for two vectors x and y
% sunriseset - calculates sunrise and sunset times