Search the OCDB using the GUI

All data the submitters have agreed to publish data searchable for the public.

The OCDB WebUI offers a graphical search interface. Main feature of this interface is the search text field.

Lucene Syntax

The search field allows using the so-called Lucene syntax which enables you to search for strings and substrings as well as for ranges in specific metadata headers (see list below).

A concise description of the full Lucene query language syntax can be found here. Please note that the OCDB system does not support the complete syntax.

General syntax:

[metadata_header]: [search_term]

Example (Exact match):

investigators: Colleen

Returns all datasets where the field “investigators” exactly matches the term “Colleen”.

Wild Card:

Lucene syntax offers two wildcards; the “*” represents multiple characters, the “?” denotes a single character wildcard.

Note: You cannot use a * or ? symbol as the first character of a search.

So the first example below returns all datasets with the investigators field containing “Coll”, surrounded by any number of characters, whereas the second returns datasets with “Coll” followed by two undefined characters and ‘n’.

investigators: Coll*
investigators: Coll??n

To search for any word containing the char ‘a’ use:

a*

To search for all parameters starting with ‘a’ use:

fields: ,a*

Keep in mind that the value for the metadata header fields is a comma-separated list of parameter names.

Please note:

  • words starting with a digit, must be written in quotes

  • words containing wildcards must be written without quotes

  • the following special characters must be escaped by a preceding backslash if not written in quotes:

"+ - && || ! ( ) { } [ ] ^ " ~ * ? : \"

Examples:

\-999.0
missing: "-999.0" 

Operators AND/OR:

These operators allow to combine conditions. As expected, the “AND” implements a logical AND, the “OR” represents the logical OR operation.

Please note:

  • The operators AND and OR must be written in upper case.

investigators: Colleen* AND start_date: '2016-04-01'
investigators: Colleen* OR investigators: *Helge*
fields: ,chl_a*  or ,sza*

Operator TO to search for ranges:

Thus, searches with numeric ranges require that start and end values have the same length, which is explicitly true for dates.

received: ["20191104" TO "20191108"]
start_date: ["19900101" TO "20211231"] AND end_date: ["20210101" TO "20221231"]
water_depth: ["10" TO "20"]
north_latitude: ["50" TO "60"]

The first example will list all files which:

  • have been submitted between 2019.11.04 and 2019.11.08

  • contain data in the period 2021.01.01 and 2021.12.31

  • contain data measured in water_depths between 10 and 20 meters

  • contain data in latitudes ranging between 50 and 60 degrees north

Please note:

  • The operator TO must be written in upper case.

  • All words are treated as strings, even if they represent numeric content.

When applying the operator ‘TO’, alphanumerical comparisons are used (i. e. ‘C’ > ‘B’ is TRUE and ‘20’ < ‘9’ is TRUE as well!).

The following fields can be considered:

  • path: Path where data files are stored

  • received: Date when data were received (optional)

  • identifier_product_doi: Product DOI (conditional, if available)

  • investigators: Primary Investigators (PIs) of the experiment

  • affiliations: Affiliations of the PIs (see path)

  • contact: Contact (email address) of the PIs

  • experiment: Identifier of the experiment (see path)

  • cruise: Identifier of the cruise (see path)

  • station: Name of the station where data were obtained (conditional, i. e. required if station does not appear in fields)

  • data_file_name: Data file name

  • data_type: Data type (e.g. scan, cast, above_water, …) (mandatory)

  • data_status: Could be preliminary, update or finally (optional but recommended)

  • start_date, end_date: Start and end date

  • start_time, end_time: Start and end time

  • north_latitude, south_latitude, west_longitude, east_longitude: Bounding box coordinates

  • water_depth: Water bottom depth at measurement point (in meters) (mandatory)

  • measurement_depth: Measurement/Sample depth (in meters) (conditionally)

  • secchi_depth: Secchi depth (in meters) (optionally but recommended)

  • missing: Fill value for unvalid data (non-zero, common choice -9999)

  • below_detection_limit: Numeric NULL value for values below detection limit (optional but recommended, common choice -8888)

  • above_detection_limit: Numeric NULL value for values above detection limit (optional but recommended, common choice -7777)

  • delimiter: Delimiter of data file e.g. ‘tab’, ‘comma’ or ‘space’ (e. g. ‘delimiter: comma’)

Examples:

path: "My_Affiliation/My_experiment/My_cruise"
station: "Blyth_NOAH"
start_date: "20160429"
start_time: "17:04:16 [GMT]"
north_latitude  "61.134032 [DEG]" 
missing: "-999.0" 

Consider that some of the metadata in the above list are not mandatory, thus the search results for these metadata headers could be non-exhaustive.

Search examples

Products (Parameter)

  1. Products can be chosen from a select list within the advanced search dialog. However, valid search results can only be obtained for products without postfix, e. g. wavelengths.

  2. For postfixed products such as ‘Rrs400’ or ‘SZA1020’ the search text field shall be used. All product names have to be followed by ‘*’ or ‘?’:

fields: SZA* OR fields: Chl_a*

Product groups

The webbased search interface allows to restrict result sets to certain geophysical variable types, organised by groups. They can be chosen from a selct list. A list of groups and the variables covered is given in the table below (Status: 2026-01-30). Single product acronyms are fully described in OCDB standard field names and units.

Product group

Products

Product groupA578_A434, abs, abs*, abs_ad, abs_ag, abs_ap, abs_blank_ad, abs_blank_ag, abs_blank_ap, abs_nacl, abs_zero
AbsorbanceA578_A434, abs, abs*, abs_ad, abs_ag, abs_ap, abs_blank_ad, abs_blank_ag, abs_blank_ap, abs_nacl, abs_zero
Absorptiona, a*ph, a*srfa, ad, ad_model, ad_unc, adg, ag, ag_model, agp, ap, ap_unc, aph, aph_model, aph_unc, asrfa, aw, S_ad, S_ag
Atmosphereaaer, angstrom, AOT, At, C2H3N_H, C2H4O_H, C2H6S_H, C3H6O_H, C5H8_H, C6H6_H, CH4O_H, CH4S_H, cloud, dewpoint, DMSA, iso_C2H3N_H, iso_C2H4O_H, iso_C2H6S_H, iso_C3H6O_H, iso_CH4O_H, no2ot, o3ot, Oz, pressure_atm, rot, total_NO2, Total_precipitable_water, wdir, wind, wvot, Wvp
Beam Attenuationc, cg, cgp, cnw, cp, cp_gamma, cp_gamma_RMSE, cw
CarbonDIC, DIC_L, DOC, DOC_L, fecalpellet_production_carbon_carbonweight, fecalpellet_production_carbon_dryweight, PC, PIC, PIM, POC, POC_cp, TOC, TOC_L, Total_C, VOCair
CDOMcdmf, cdmf_counts, cdom, cdomf
CMECSbiotic_class, biotic_community, biotic_group, biotic_setting, biotic_subclass, geoform, geoform_origin, geoform_physiographic_setting, geoform_tectonic_setting, geoform_type, substrate_class, substrate_group, substrate_origin, substrate_subclass, substrate_subgroup, water_column_biogeochemical_feature, water_column_hydroform, water_column_hydroform_class, water_column_hydroform_type, water_column_layer, water_column_salinity, water_column_temperature
Depthdepth, water_depth, Z_90, Z_DCM, Z_Eu, Z_MLD, Z_XML
Diffuse AttenuationKd, Kl, Knf, Kpar, Ku
Dissolved gases13CO2_amps, amplitude, AOU, AOU_kg, Ar_amps, CO2_amps, DMSSW, H2O_amps, N2_amps, N2_Ar_ratio, N2_fix, N2_O2_ratio, NH4, NO2, NO2_NO3, NO3, O2_amps, O2_Ar_ratio, oxygen, oxygen_kg, oxygen_phase, oxygen_saturation, pCO2, PO4, TDN, TDN_kg
Flow cytometryFSC-A, FSC-H, FSCpar-H, FSCperp-H, MUF, pulse_width, SSC-A, SSC-C
FluorescenceF-initial, FL-A, FL-H, Fm, Fv_Fm, natf, Rf, stimf, stimf_counts, TauAv
Fluxflux_ATN, flux_Ba, flux_bSi, flux_calibration_method, flux_mass, flux_P, flux_particles, flux_Pb_210, flux_PC, flux_PC_ATN, flux_PIC, flux_PN, flux_Po_210, flux_POC, flux_sampling_method, flux_sinking_speed_method, flux_submethod, flux_Th_234
Genericratio
GeneticDNA, RNA, rrna_gene
Geographicaltitude, lat, lat_end, lat_recovery, lat_resurface, lat_start, lon, lon_end, lon_recovery, lon_resurface, lon_start
Geometricheading, nadir, pitch, pitch_Ed, pitch_Es, pitch_Lu, RelAz, roll, roll_Ed, roll_Es, roll_Lu, SAZ, sensor_azimuth, SenZ, SZA, tilt, tilt_Ed, tilt_Es, tilt_Lu
Granulometrybin_depth, bin_diameter_center, bin_diameter_lower, bin_diameter_upper, bincount, PSD, PSD_DNSD, PSD_DVSD
Grazingg, g_herb
Hydrogeomorphologybenthic_type
IceMSV
IdentifierR2R_Event
InstrumentalIt, pressure_instrument, SN
IrradianceE_scalar, Ed, Ed_scalar, EdGND, EdPAR_scalar, Elw, Epar, EPAR_scalar, Es, EsGND, Esky, Esun, Esw, Eu, Eu_scalar, EuGND, EuPAR, EuPAR_scalar, F0
IsotopicBa_138_134_d_delta
Metalsconc_Ba_total, conc_particles, conc_Pb_210, conc_Po_210, conc_Th_234, conc_U_238
MineralsBSi, LSi, SiO4
Nutrientsaa_alanine, aa_arginine, aa_aspartic_acid, aa_beta-alanine, aa_gamma_amino_butyric_acid, aa_glutamic_acid, aa_glycine, aa_histidine, aa_isoleucine, aa_leucine, aa_lysine, aa_methionine, aa_phenylalanine, aa_serine, aa_taurine, aa_threonine, aa_tyrosine, aa_valine, AMC, AMC-Leu, DCAA, MUF-But, MUF-Glu, MUF-PO4, Urea
PARPAR
ParticulatePN, PN_mmol, POH, POM, PON, PTP, sinking_velocity, SPM, TEP_bottle, TEP_MSC
Photosynthesissigma_PSII
Physical Oceanographycond, N2avg, pH, pressure, sal, sigma_theta, sigmaT, SST, total_alkalinity, VelEast, VelNorth, VelUp, waveht, Wt
PigmentsAllo, alpha-beta-Car, Anth, Asta, BChl_a, beta-beta-Car, beta-epi-Car, beta-psi-Car, But-fuco, Cantha, Chl, Chl_a, Chl_a_allom, Chl_a_prime, Chl_b, Chl_c, Chl_c1, Chl_c1c2, Chl_c2, Chl_c3, Chl_experiment, Chl_lineheight, Chl_stimf, Chlide_a, Chlide_b, chors_id, Croco, Diadchr, Diadino, Diato, Dino, DP, DV_Chl_a, DV_Chl_b, Echin, epi-epi-Car, Fuco, Gyro, Hex-fuco, Hex-kfuco, hpl_id, hplc_gsfc_id, length_representation, Lyco, Me-chlide_a, Me-chlide_b, Mg_DVP, Monado, MV_Chl_a, MV_Chl_b, Neo, P-457, Perid, PHAEO, Phide_a, Phide_b, Phide_c, Phycocyanin, Phytin_a, Phytin_b, Phytin_c, Phytyl-chl_c, PPC, PPC_Tcar, PPC_Tpg, Pras, PSC, PSC_Tcar, PSP, PSP_Tpg, Pyrophide_a, Pyrophytin_a, Pyrophytin_b, Pyrophytin_c, Siphn, Siphx, Tacc, Tacc_Tchla, Tcar, Tchl, TChl_Tcar, Tchla_Tpg, Tot_Chl_a, Tot_Chl_b, Tot_Chl_c, Tpg, Vauch, Viola, Zea
Plankton3H_Leu, 3H_Leu_L, abun, abun_bacterioplankton, abun_phyto, abun_zoop, abundance, area_based_diameter, area_cross_section, bactP, biomass_microzoop, biovol_bacterioplankton, biovolume, equivalent_spherical_diameter, Et-8-carot, Et-chlide_a, Et-chlide_b, mPF, nanoeukaryote, nPF, phyto_carbon, picoeukaryote, pPF, prochlorococcus, RelAbundance, scientificName_automated, scientificName_manual, scientificNameID_automated, scientificNameID_manual, synechococcus, u, u_ph, u_zoo, weight_dry, weight_protein_community, weight_protein_sample, weight_wet, width_representation, zoop_biomass_dry, zoop_biomass_wet
Primary ProductivityGPP, NPP, PP, rate_13C_uptake_bottle, rate_15N_uptake_bottle, rate_diff_02_dark_incub, rate_diff_C_dark_incub, rate_diff_O2_corrcoef, rate_nitrification, rate_production_DOC_inc, rate_respiration_C_ind, rate_respiration_carbon_zoop, rate_respiration_O2_ind
ProductivityNCCb, NCP_O2_Ar_ratio_continuous, NCPb, oxygen_consumption_temcorr
Qualityquality
RadianceLsky, Lt, Lu, LuGND, Lut, Lw, Lw_unc, Lwn, Lwnex
ReflectanceQ, R, Rb, rho, Rl, Rpi, Rrs, Rrs_unc
Samplingbottle, cast, data_provider_category_automated, data_provider_category_manual, net_interval, profile, pvel, sample, station, station_alt_id, volfilt, volume
Scatteringb, bb, bb_counts, bbp, bbp_bp, bbp_gamma, bbw, bp, bw, nrb, Sv, VSF, VSFg, VSFp, VSFw
Secchisecchi_depth
Spectrometrycycle, mz, pDrift, Tdrift
Sunphotometrysig
Timedate, date_end, date_processed, date_recovery, date_resurface, date_start, day, elapsed_time, hour, incubation_duration, minute, month, sdy, second, time, time_end, time_processed, time_recovery, time_resurface, time_start, year
Transmissiontrans
Water claritySZ, turbidity

For a more detailed and up-to-date overview table, please refer to the OCDB homepage and choose Overview in the main menu. There, you will find the number of SEABASS datasets provided for each product.

For a detailed list of SEABASS parameter names see: https://seabass.gsfc.nasa.gov/wiki/stdfields.

Time range

In order to choose a time period covered by the data files, the metadata headers start_date and end_date can be used as follows to search for data partly covering 1. Jan. to 31. Dec. 2021:

start_date: ["19000101" TO "20211231"] OR end_date: ["20210101" TO "20990101"]

Region

  1. You can use the interactive map to select a region by a rectangle or a polygon.

  2. You can use the Python API or the OCDB command line interface to search for datasets by defining a certain region (see OCDB Command Line Client and Python API).