Re: [galeon] The GALEON wiki and Use Cases

To: Ben Domenico <Ben@xxxxxxxxxxxxxxxx>
Subject: Re: [galeon] The GALEON wiki and Use Cases
From: Roy Mendelssohn <Roy.Mendelssohn@xxxxxxxx>
Date: Sat, 11 Oct 2008 11:19:23 -0700

Hi Ben:

Assume the simpler case for the second scenario. I have gridded dataat depth, and I want all the data in the top 200 meters for a givenbounding box and parameter. The animal case is not quite theequivalent of the sensor - for the sensor it is a single sensor whosetrack I am storing. For the animal, I have a vector of is locationsand depth, and say I have a 4-D dataset on a grid, I now want totunnel through that dataset to find out the environment based onanother set of data, not the data that a sensor on the animaldetected. The animal sensor only gives position/depth.

Neither of these are synoptic nor a single time series from onesensor - and I think it is very important to include these in the usecases because our experience to date is these types of data extractshave not really been in the OGC radar and for which extracts areeither impossible or else extremely complex.

The one exception to this is CSML, which I like for the very reasonsthat the features types align well with both how users of the datathink of the data and how they use the data. And the CDM is good forthe very same reasons, and to beat a dead horse, the two are veryclose. This is the very reason why I have argued to make thecrosswalk between CDM and CSML be the main activity, and then usewhatever services CSML ends up using, rather than to constantly tryand convince a very large body that all the scientists that use thedata perhaps have reasons for thinking about the data they way theydo, rather than the way OGC says we must (and remember I am actuallya research scientist, and I still access and use large quantity ofdata in analyses - these are not theoretical discussions to me, butones that affect my day to day ability to do my work). I believeAndrew Woolf made a similar comment awhile back (it was actuallyworded much more strongly but I will leave it to Andrew to decide ifhe wants to repeat it).


Thanks,

-Roy
On Oct 11, 2008, at 11:01 AM, Ben Domenico wrote:

Hi Roy,
You make very good points. In my effort to keep the use casesbrief, I did not make it clear that the intention was for each oneto represent a particular category or type of data.
So to take your cases, if I understand the animal track environmentexample, I am guessing you are talking about an animal (perhaps adolphin) that is instrumented to monitor some properties of itsenvironment as it travels, . In my list, that would be an oceanequivalent of the trajectory case that's represented by the aircraft-borne observations. My thought is that, if we agree on a set ofconventions for representing such trajectories, we can use it forobservations along dolphin tracks, aircraft tracks, ship tracks,etc. One additional note is that, in all my cases, I emphasize thatwe should be prepared to address collections of such observations aswell as individual ones. So I will find a way to make it clear thatmy proposed use cases are intended to be representative and andcould be used for other cases that involve similar data types.
Regarding the other case you mention of comparing present conditionswith long term trends in a particular area, my idea is that thoseare just different space-time bounding boxes for the datacollections in the region you are interested in. If you are talkingabout using observations from moored buoys in this case, it fitsnicely with my proposed case for obtaining the station obs in theregion around Paris. If you include CTD ocean soundings, it'sequivalent to the Paris use case that includes atmospheric balloonsoundings.
You say you can deal with these cases in the netCDF/OPeNDAP world.My question is, if I define a region of interest in the ocean and aset of time bounds (short or long term) and then ask for all theobservations from instrumented dolphins, what are the CF conventionsthat describe the netCDF that I get back? More specifically, how doI figure out where and when all those observations were taken?
I would ask the same question regarding the station data in theupwelling region case you mention. What are the CF conventions thatprovide the information needed to figure out where and when thestation (or buoy) data points were observed?
Those are exactly the kinds of conventions John Caron is working onand where I think we need some consensus. If we come to agreementon those CF conventions, then we can propose the resulting CF-netCDFas a standard coverage encoding as a means of connecting our workwith the formal standards community.. But, if you think we alreadyhave an explicit way to deal with these cases in the netCDF/OPeNDAPworld, please let me know. Maybe there's a way to short cut themulti-step process I had envisioned.
On the other hand, my next task is to revise my use cases toindicate that, while they are written as specific cases, they areintended to be representative of a family of cases for each of thedata types. Hopefully I can do that without getting too wordy.
-- Ben

Follow-Ups:
- Re: [galeon] The GALEON wiki and Use Cases
  - From: Ben Domenico

References:
- [galeon] The GALEON wiki and Use Cases
  - From: Ben Domenico
- Re: [galeon] The GALEON wiki and Use Cases
  - From: Ben Domenico
- Re: [galeon] The GALEON wiki and Use Cases
  - From: Roy Mendelssohn
- Re: [galeon] The GALEON wiki and Use Cases
  - From: Ben Domenico