<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">AMT</journal-id>
<journal-title-group>
<journal-title>Atmospheric Measurement Techniques</journal-title>
<abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1867-8548</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/amt-6-2851-2013</article-id>
<title-group>
<article-title>Semi-autonomous sounding selection for OCO-2</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Mandrake</surname>
<given-names>L.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Frankenberg</surname>
<given-names>C.</given-names>
<ext-link>https://orcid.org/0000-0002-0546-5857</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>O'Dell</surname>
<given-names>C. W.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Osterman</surname>
<given-names>G.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Wennberg</surname>
<given-names>P.</given-names>
<ext-link>https://orcid.org/0000-0002-6126-3854</ext-link>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Wunch</surname>
<given-names>D.</given-names>
<ext-link>https://orcid.org/0000-0002-4924-0377</ext-link>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Colorado State University, Fort Collins, CO, USA</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>California Institute of Technology, Pasadena, CA, USA</addr-line>
</aff>
<pub-date pub-type="epub">
<day>25</day>
<month>10</month>
<year>2013</year>
</pub-date>
<volume>6</volume>
<issue>10</issue>
<fpage>2851</fpage>
<lpage>2864</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2013 L. Mandrake et al.</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 3.0 Unported License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/3.0/">https://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://amt.copernicus.org/articles/6/2851/2013/amt-6-2851-2013.html">This article is available from https://amt.copernicus.org/articles/6/2851/2013/amt-6-2851-2013.html</self-uri>
<self-uri xlink:href="https://amt.copernicus.org/articles/6/2851/2013/amt-6-2851-2013.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/6/2851/2013/amt-6-2851-2013.pdf</self-uri>
<abstract>
<p>Many modern instruments generate more data than may be fully processed in a
timely manner. For some atmospheric sounders, much of the raw data cannot be
processed into meaningful observations due to suboptimal viewing conditions,
such as the presence of clouds. Conventional solutions are quick,
empirical-threshold filters hand-created by domain experts to weed out
unlikely or unreasonable observations, coupled with randomized down sampling
when the data volume is still too high. In this paper, we describe a method
for the construction of a subsampling and ordering solution that maximizes
the likelihood that a requested data subset will be usefully processed. The
method can be used for any metadata-rich source and implicitly discerns
informative vs. non-informative data features while still permitting user
feedback into the final features selected for filter implementation. We
demonstrate the method by creating a selector for the spectra of the
Japanese GOSAT satellite designed to measure column averaged mixing ratios
of greenhouse gases including carbon dioxide (CO&lt;sub&gt;2&lt;/sub&gt;). This is done within
the Atmospheric CO&lt;sub&gt;2&lt;/sub&gt; Measurements from Space (ACOS) NASA project with
the intention of eventual use during the early Orbiting Carbon Observatory-2
(OCO-2) mission. OCO-2 will have a 1.5 orders of magnitude larger data
volume than ACOS, requiring intelligent pre-filtration.</p>
</abstract>
<counts><page-count count="14"/></counts>
</article-meta>
</front>
<body/>
<back>
<ref-list>
<title>References</title>
<ref id="ref1">
<label>1</label><mixed-citation publication-type="other" xlink:type="simple">Boesch, H., Backer, D., Connor, B., Crips, D., and Miller, C.: Global Characterization of CO&lt;sub&gt;2&lt;/sub&gt; Column Retrievals from Shortwave-Infrared Satellite Observations of thet Orbiting Carbon Observatory-2 Mission, Remote Sens., 3, 270–304, 2011.</mixed-citation>
</ref>
<ref id="ref2">
<label>2</label><mixed-citation publication-type="other" xlink:type="simple">Conner, B., Boesch, H., Toon, G., Sen, B., Miller, C., and Crisp, D.: Orbiting Carbon Observatory: Inverse method and prospective error analysis, J. Geophys. Res.-Atmos., 113, D05305, &lt;a href=&quot;http://dx.doi.org/10.1029/2006JD008336&quot;&gt;https://doi.org/10.1029/2006JD008336&lt;/a&gt;, 2008.</mixed-citation>
</ref>
<ref id="ref3">
<label>3</label><mixed-citation publication-type="other" xlink:type="simple">Crisp, D., Atlas, R. M., Breon, F.-M., Brown, L.R., Burrows, J. P., Ciais, P., Connor, B. J., Doney, S. C., Fung, I. Y., Jacob, D. J., Miller, C. E., O&apos;Brien, D., Pawson, S., Randerson, J. T., Rayner, P., Salawitch, R. J., Sander, S. P., Sen, B., Stephens, G. L., Tans, P. P., Toon, G. C., Wennberg, P. O., Wofsy, S. C., Yung, Y. L., Kuang, Z., Chudasama, B., Sprague, G., Weiss, B., Pollock, R., Kenyon, D., and Schroll, S.: The Orbiting Carbon Observatory (OCO) mission, Adv. Space Res., 34, 700–709, 2004.</mixed-citation>
</ref>
<ref id="ref4">
<label>4</label><mixed-citation publication-type="other" xlink:type="simple">Crisp, D., Fisher, B. M., O&apos;Dell, C., Frankenberg, C., Basilio, R., Bösch, H., Brown, L. R., Castano, R., Connor, B., Deutscher, N. M., Eldering, A., Griffith, D., Gunson, M., Kuze, A., Mandrake, L., McDuffie, J., Messerschmidt, J., Miller, C. E., Morino, I., Natraj, V., Notholt, J., O&apos;Brien, D. M., Oyafuso, F., Polonsky, I., Robinson, J., Salawitch, R., Sherlock, V., Smyth, M., Suto, H., Taylor, T. E., Thompson, D. R., Wennberg, P. O., Wunch, D., and Yung, Y. L.: The ACOS CO&lt;sub&gt;2&lt;/sub&gt; retrieval algorithm – Part II: Global X$_CO_2$ data characterization, Atmos. Meas. Tech., 5, 687–707, &lt;a href=&quot;http://dx.doi.org/10.5194/amt-5-687-2012&quot;&gt;https://doi.org/10.5194/amt-5-687-2012&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref5">
<label>5</label><mixed-citation publication-type="other" xlink:type="simple">Frankenberg, C., Platt, U., and Wagner, T.: Iterative maximum a posteriori (IMAP)-DOAS for retrieval of strongly absorbing trace gases: Model studies for CH&lt;sub&gt;4&lt;/sub&gt; and CO&lt;sub&gt;2&lt;/sub&gt; retrieval from near infrared spectra of SCIAMACHY onboard ENVISAT, Atmos. Chem. Phys., 5, 9–22, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-5-9-2005&quot;&gt;https://doi.org/10.5194/acp-5-9-2005&lt;/a&gt;, 2005.</mixed-citation>
</ref>
<ref id="ref6">
<label>6</label><mixed-citation publication-type="other" xlink:type="simple">Guerlet, S., Butz, A., Schepers, D., Basu, S., Hasekamp, O. P., Kuze, A., Yokota, T., Blavier, J.-F., Deutscher, N. M., Griffith, D., W., T., Hase, F., Kyro, E., Mornio, I., Sherlock, V., Sussman, R., Galli, A., and Aben, I.: Impact of aerosol and thin cirrus on retrieving and validating X$_CO_2$ from GOSAT shortwave infrared measurements, J. Geophys. Res.-Atmos., 118, 4887–4905, &lt;a href=&quot;http://dx.doi.org/10.1002/jgrd.50332&quot;&gt;https://doi.org/10.1002/jgrd.50332&lt;/a&gt;, 2013.</mixed-citation>
</ref>
<ref id="ref7">
<label>7</label><mixed-citation publication-type="other" xlink:type="simple">Guyon, I. and Elisseeff, A.: An Introduction to Variable and Feature Selection, J. Machine Learn. Res., 3, 1157–1182, 2003.</mixed-citation>
</ref>
<ref id="ref8">
<label>8</label><mixed-citation publication-type="other" xlink:type="simple">Jin, Y. and Sendhoff, B.: Pareto-based multi-objective machine learning: An overview and case studies, IEEE Trans. Systems, Man, and Cybernetics, Part C: Applic. Rev., 38, 397–415, 2008.</mixed-citation>
</ref>
<ref id="ref9">
<label>9</label><mixed-citation publication-type="other" xlink:type="simple">O&apos;Brien, D. M., Pollock, R., Polonsky, I., and Rogers, M.: Identification and Correction of Residual Image in the O&lt;sub&gt;2&lt;/sub&gt; A-Band of the Orbiting Carbon Observatory, IEEE Trans. Geosci. Remote Sens., 49, 2426–2437, 2011.</mixed-citation>
</ref>
<ref id="ref10">
<label>10</label><mixed-citation publication-type="other" xlink:type="simple">O&apos;Dell, C. W., Connor, B., Bösch, H., O&apos;Brien, D., Frankenberg, C., Castano, R., Christi, M., Eldering, D., Fisher, B., Gunson, M., McDuffie, J., Miller, C. E., Natraj, V., Oyafuso, F., Polonsky, I., Smyth, M., Taylor, T., Toon, G. C., Wennberg, P. O., and Wunch, D.: The ACOS CO&lt;sub&gt;2&lt;/sub&gt; retrieval algorithm – Part 1: Description and validation against synthetic observations, Atmos. Meas. Tech., 5, 99–121, &lt;a href=&quot;http://dx.doi.org/10.5194/amt-5-99-2012&quot;&gt;https://doi.org/10.5194/amt-5-99-2012&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref11">
<label>11</label><mixed-citation publication-type="other" xlink:type="simple">Periaux, J. and Galan, M.: Genetic Algorithms in Engineering and Computer Science, John Wiley &amp; Son Ltd, 1995.</mixed-citation>
</ref>
<ref id="ref12">
<label>12</label><mixed-citation publication-type="other" xlink:type="simple">Reuter, M., Bovensmann, H., Buchwitz, M., Burrows, J. P., Connor, B. J., Deutscher, N. M., Griffith, D. W. T., Heymann, J., Keppel-Aleks, G., Messerschmidt, J., Notholt, J., Petri, C., Robinson, J., Schneising, O., Sherlock, V., Velazco, V., Warneke, T., Wennberg, P. O., and Wunch, D.: Retrieval of atmospheric CO&lt;sub&gt;2&lt;/sub&gt; with enhanced accuracy and precision from SCIA- MACHY: Validation with FTS measurements and comparison with model results, J. Geophys. Res., 116, D04301, &lt;a href=&quot;http://dx.doi.org/10.1029/2010JD015047&quot;&gt;https://doi.org/10.1029/2010JD015047&lt;/a&gt;, 2011.</mixed-citation>
</ref>
<ref id="ref13">
<label>13</label><mixed-citation publication-type="other" xlink:type="simple">Taylor, T. E., O&apos;Dell, C. W., O&apos;Brien, D. M., Kikuchi, N., Yokota, T., Nakajima, T. Y., Ishida, H., Crisp, D., and Nakajima, T.: Comparison of Cloud-Screening Methods Applied to GOSAT Near-Infrared Spectra, IEEE Trans. Geosci. Remote Sens., 50, 295–309, 2012.</mixed-citation>
</ref>
<ref id="ref14">
<label>14</label><mixed-citation publication-type="other" xlink:type="simple">Wunch, D., Wennberg, P. O., Toon, G. C., Connor, B. J., Fisher, B., Osterman, G. B., Frankenberg, C., Mandrake, L., O&apos;Dell, C., Ahonen, P., Biraud, S. C., Castano, R., Cressie, N., Crisp, D., Deutscher, N. M., Eldering, A., Fisher, M. L., Griffith, D. W. T., Gunson, M., Heikkinen, P., Keppel-Aleks, G., Kyrö, E., Lindenmaier, R., Macatangay, R., Mendonca, J., Messerschmidt, J., Miller, C. E., Morino, I., Notholt, J., Oyafuso, F. A., Rettinger, M., Robinson, J., Roehl, C. M., Salawitch, R. J., Sherlock, V., Strong, K., Sussmann, R., Tanaka, T., Thompson, D. R., Uchino, O., Warneke, T., and Wofsy, S. C.: A method for evaluating bias in global measurements of CO&lt;sub&gt;2&lt;/sub&gt; total columns from space, Atmos. Chem. Phys., 11, 12317–12337, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-11-12317-2011&quot;&gt;https://doi.org/10.5194/acp-11-12317-2011&lt;/a&gt;, 2011.</mixed-citation>
</ref>
<ref id="ref15">
<label>15</label><mixed-citation publication-type="other" xlink:type="simple">Yokota, T., Oguma, H., Morino, I., and Inoue, G.: A nadir looking SWIR FTS to monitor CO&lt;sub&gt;2&lt;/sub&gt; column density for Japanese GOSAT project, Proc. Twenty-fourth Int. Sympo. on Space Technol. and Sci., JSASS and Organizing Comm. of the 24th ISTS, 887–889, 2004.</mixed-citation>
</ref>
</ref-list>
</back>
</article>