All scientific data collected by the Australian Antarctic program (AAp) are eventually described in the Catalogue of Australian Antarctic and Subantarctic Metadata (CAASM). CAASM can be used to search through AAp data descriptions, and it also provides links to access publicly available datasets, which can either be immediately downloaded or obtained from the Australian Antarctic Data Centre (AADC).

View the full metadata record
Van Dorst, J., Wilkins, D., Crane, SL., Montgomery, K., Zhang, E. and Ferrari, BC. (2021) Soil microbiological and chemical data from the Casey Station biopiles (numbers 1 to 5, and the control plot), covering the period 2011 to 2018., Ver. 1, Australian Antarctic Data Centre - doi:10.26179/xr6k-7n37, (Unreleased data)
Soil microbiological and chemical data from the Casey Station biopiles (numbers 1 to 5, and the control plot), covering the period 2011 to 2018.
Data Centre
Australian Antarctic Data Centre, Australia
Created Date
Revision Date
Parent record


Soil chemical parameters and microbiological data obtained from soil samples taken from biopiles 1,2,3,4 and 5, and the control “test plot” established at Casey station, Antarctica, for the purpose of the remediation of petroleum hydrocarbon contaminants present in the soil.

Analysis of inorganic soil parameters
Soil samples were analysed at the Australian Antarctic Division’s (AAD) laboratory, or analysed at one of two commercial, nationally accredited laboratories (National Association of Testing Authorities, Australia). Samples analysed at AAD for pH, EC, ammonia, nitrite, nitrate and phosphate were analysed on 1:5 soil:Milli-Q extracts (10g soil: 50ml deionised water) after being shaken end-over-end for 1 hour. Exchangeable ammonium was analysed using a 2M KCl extraction. Extractable phosphate was also measured using the Colwell P method (method 9B1 in Rayment and Lyons) with colorimetric molybdenum-blue determination. Samples submitted to external laboratories for nitrite and nitrate were analysed on 1:5 soil water extracts (10g soil: 50ml deionised water), whilst exchangeable ammonium was analysed using a 1:10 extraction with borate buffer.

The data is presented in the enclosed file nutrients_norm.csv, with column headers explained in more detail in the enclosed file nutrients_norm_codex.csv.
Analysis of hydrocarbons in soil
Field moist soil samples were collected in amber glass jars with PTFE lined lids, stored and transported at -18°C. Samples for analysis were thawed, tipped into a clean foil or glass dish and thoroughly mixed or chopped with a spatula to maximise homogeneity. Using a straight spatula with a width of 9 mm, a representative 12 g subsample was taken, excluding any large gravel greater than 8 mm across its largest dimension. The subsample was weighed into a 40 mL glass headspace vial. Hydrocarbons were extracted from homogenised wet soil by tumbling overnight for 17 hours with a mixture of 10 mL of deionised water, 10 mL of hexane, and 1 mL of hexane spiked with internal standards: 250 mg/L Bromoeicosane; 50 mg/L p-Terphenyl; 50 mg/L Tetracosane-d50 (C24D50); 50 mg/L 1,4-Dichlorobenzene; and 250 mg/L Cyclooctane. Samples were then centrifuged for up to 10 minutes at 1000 rpm. Following removal of the hexane extract, soil remaining in the vial was dried at 105 °C for 24 hours to constant weight, the dry mass recorded, and moisture content calculated. Extracts were analysed for TRH by gas chromatography using instrument conditions described in van Dorst et al (submitted).
TRH concentrations were determined using a calibration curve, generated from standard solutions of special Antarctic blend diesel (SAB). TRH was measured using the ratio of the total detector response of all hydrocarbons to the internal standard peak response of Bromoeicosane. Calibrations, instrument performance and retention time assignments were set up using standards as shown in van Dorst et al (submitted). Matlab was used to perform baseline subtraction of the hexane chromatogram profile and integrate individual identified compounds as well as summed hydrocarbon fractions. Resolved peaks distinct from the Unresolved Complex Mixture (UCM) were calculated and the reproducible baseline subtraction allowed for an evaluation of small UCM changes as fuel degradation proceeds in the field. MS Access was used to manage samples, standards, raw laboratory data and metadata, to enable the application of an automated algorithm to calculate reports. Quality control samples and standards were run in every batch.
The data is presented in the enclosed file BP_TRH_15TP.csv, with column headers explained in BP_TRH_CODEX.csv.

Microbial community DNA extraction and 16S rRNA amplicon sequencing
Total gDNA was extracted from soil samples in triplicate and quantified using the FastDNA SPIN kit for soils (MP Biomedicals, Seven Hills, NSW, Australia), according to the manufacturers protocol. Triplicates were combined, quantified with picogreen (Invitrogen, Mount Waverley, VIC, Australia) and diluted to a standard working concentration (10 ng/µl). The 16S rRNA gene was targeted with the universal primers, 28F and 519R. Amplicon sequencing was performed on the Roche 454 titanium platform and on the Illumina Miseq platform. The 454 sequence data was processed using the MOTHUR software package as described in van Dorst et al. 2014. Briefly, sequences and flowgrams were extracted from the .sff files, de-multiplexed and error-checked via the Pyronoise algorithm in MOTHUR. After further quality control screening, bacterial seed sequences were aligned to the curated SILVA secondary structure alignment. Aligned 16S sequences were then clustered into OTUs (Operational Taxonomic Unit) based on 97% sequence similarity. For illumina sequence reads, sequences were quality filtered, trimmed, and clustered denovo to pick OTUs at 97% identity. Reads were then assigned to sample-by-OTU matrices. OTUs were taxonomically classified against the SILVA v3.2.1 SSU rRNA database for both sample-by-OTU matrices. The 454 OTU matrices were merged with the illumina OTU matrices using the QIIME 2 ( feature-table merge option. Matrices were rarefied using the qiime feature-table rarefy function to generate random subsamples. The control samples were used to evaluate the run variation. Alpha and beta diversity indices were not different between 454 and illumina control samples. To run the Analysis of composition of microbes (ANCOM) and generate PCO plots the full data identified to the genera level was used from the file seq_full_PCO.csv. For generating the bubble plot, only genera representing greater than 0.1% of the total relative abundance were used from the seq_reduced.csv file.

The 16S rRNA amplicon sequencing data was utilised from the two separate files, seq_full_PCO.csv and Seq_red_bubble.csv with the consistent column headers explained in 16SSeq_CaseyBiopiles_CODEX.csv.

Microbial community gene abundances with MFQPCR
Quantification of key hydrocarbon degrading genes, nitrogen cycling genes and genes indicative of the overall bacterial and fungal load were calculated with microfluidic quantitative polymerase chain reaction (MFQPCR). Like traditional qPCR, MFQPCR is based on real-time detection of DNA-complexing fluorophores, but as a high throughput technology, it reduces manual pipetting and allows simultaneous quantification of multiple genes and samples. Primers targeting universal genes for Fungi, Bacteria, Acidobacteria and Betaproteobacteria, along with a suite of primers for nitrogen cycling and hydrocarbon degrading genes were selected from the literature according to the principals outlined in Crane et al. 2018. Standards for copy number quantification were generated through artificially synthesized gBlock Gene Fragments (Integrated DNA Technologies) using representative sequences sourced from NCBI using Primer-BLAST. Standards were serially diluted, combined and added to MFQPCR assays according to the methods in Crane et al. 2018. Assays for samples and standards were conducted in a 96.96 Fluidigm Dynamic Array™ Integrated Fluidic Circuit according the manufacturer’s Evagreen® protocol. Final primer concentrations of 700 nM were used with thermo-cycling conditions of 95°C for 1 min, 35 cycles of 96°C for 5 s and 60°C for 25 s, followed by melt curve analysis for 60-95°C at a ramp rate of 1°C/3 s. The output MFQPCR data was analysed using the Real-Time PCR Analysis software, version 4.1.2 (Fluidigm) with the default quality threshold of 0.65 and linear baseline correction. Peak sensitivity was set at 7 and a peak ratio threshold of 0.7, and melt temperature (Tm) ranges were set individually based on the peaks observed in standards, as per manufacturers’ recommendations. Individual reactions were excluded from analysis if they failed any of the melt curve quality parameters, were outside 0.5 Ct of other replicates, or had a peak outside the set Tm range. The final copy numbers were converted to copy numbers per gram of soil.

The data is presented in the enclosed file qpcr_final_figure.csv with column headers explained in qpcr_CaseyBiopiles_CODEX.csv.

Show more...


Soil samples were collected over multiple years, and soil inorganic parameters were measured at different analytical laboratories, sometimes with slightly different methods. These method differences are fully explained and accounted for in the included metadata records.


These data are not yet publicly available.

Temporal Coverages

Spatial Coverages

Science Keywords

Additional Keywords

  • RRNA






  • Automated DNA Sequencer



Use Constraints

This data set conforms to the CCBY Attribution License (

Please follow instructions listed in the citation reference provided at when using these data.

Creative Commons License