<?xml version="1.0" encoding="utf-8"?>
<!-- name="GENERATOR" content="github.com/mmarkdown/mmark Mmark Markdown Processor - mmark.miek.nl" -->
<rfc version="3" ipr="trust200902" docName="draft-happel-sml-problem-statement-00" submissionType="IETF" category="info" xml:lang="en" xmlns:xi="http://www.w3.org/2001/XInclude" indexInclude="true" consensus="true">

<front>
<title>Structured Email: Problem Statement and Areas of Work</title><seriesInfo value="draft-happel-sml-problem-statement-00" stream="IETF" status="informational" name="Internet-Draft"></seriesInfo>
<author initials="H.-J." surname="Happel" fullname="Hans-Joerg Happel"><organization>audriga GmbH</organization><address><postal><street></street>
</postal><email>happel@audriga.com</email>
<uri>https://www.audriga.com</uri>
</address></author><author initials="C." surname="Junghans" fullname="Conny Junghans"><organization></organization><address><postal><street></street>
</postal><email>conny.junghans@1und1.de</email>
</address></author><date/>
<area>ART</area>
<workgroup>Network Working Group</workgroup>

<abstract>
<t>This document discusses benefits of complementing existing email standards by means that allow to replace or extend text-based email messages with message parts that describe content (full or in parts) in a machine-readable way. This would enable rich and structured interaction for its recipients - may it be human users or agents on their behalf.</t>
</abstract>

</front>

<middle>

<section anchor="introduction"><name>Introduction</name>
<t>Email is a successful and widespread technology which is likely to remain important, even if new vendors and technologies continuously challenge its role.</t>
<t>Email, to some extent, can be considered a victim of its own success. Interrelations of various components have created a relatively stable software ecosystem, which makes it difficult to introduce larger improvements. While many RFCs have updated and extended initial email specifications, the major inner workings of email remain widely unchanged since their inception.</t>
<t>This documents lines out certain issues with email content that might be addressed by standardization work. The proposed approach aims to enable novel ways of how users and automated programs can interact via email while retaining downwards compatibility with existing email standards. For additional background, see also [I-D.happel-structured-dynamic-email-00].</t>
</section>

<section anchor="problem-statement"><name>Problem statement</name>

<section anchor="email-content-is-not-machine-readable"><name>Email content is not machine-readable</name>
<t>A large share of today's emails is of transactional nature, i.e., sent by automated agents or processes to human users. While those emails often have relatively clear semantics (such as <em>Invoice</em> or <em>Reservation</em>), the medium of those emails is still human-readable text.</t>
<t>In order to help users with managing and processing their emails more efficiently, a machine-readable representation of email content can be an important building block.</t>
</section>

<section anchor="personal_api"><name>Using email as a &quot;personal API&quot;</name>
<t>Beyond improving the handling of existing email interactions, it is worthwhile to note that email can be considered a prime technology in the recent discussion about data sovereignty, which aims to give back control to users over their personal data.</t>
<t>Email is inherently open and decentralized. It is probably the only widely-used and standardized technology which seamlessly connects the local data space of users (i.e., emails on their PCs, mobile devices, or private hosted mailboxes) with the public internet.</t>
<t>It might even be considered a ubiquitous &quot;personal API&quot; of internet users, which to date is mostly based on text-based instructions (&quot;John, could you please send me this presentation?&quot;). Along these lines, structured email might serve as an enabling technology for granting users additional sovereignty in date storage and exchange when used to interact with internet services.</t>
</section>

<section anchor="existing"><name>Slow adoption of existing solutions</name>
<t>In a mature technological space such as email, any novel approach such as structured email needs to address the issue of user adoption.</t>
<t>Existing attempts to structure email interaction can be distinguished in <em>standards-based</em> and <em>vendor-driven</em> approaches. Several RFCs address very particular problems such as meeting workflows <xref target="RFC2447"></xref>, message delivery notifications <xref target="RFC8098"></xref>, mailing list subscriptions <xref target="RFC8058"></xref>, message metadata <xref target="RFC6477"></xref> or message content interpretation <xref target="RFC9078"></xref>. Creating independent RFCs for each possible type of structured email interaction might not be a feasible approach for both standards makers and client developers.</t>
<t>With respect to vendors, Google (<xref target="EMarkup"></xref>, <xref target="AMPemail"></xref>) and Microsoft (<xref target="AM"></xref>) have made attempts to structure email interactions. Even though some other vendors support, e.g., <xref target="EMarkup"></xref>, a closer look shows that those implementations are mostly incompatible among each other (<xref target="SmartInbox"></xref>, <xref target="YahooPS"></xref>, <xref target="ZohoQAES"></xref>). Besides a lack of standardization (and according tool support), widespread adoption also suffers from various sender restrictions, including manual approval processes.</t>
</section>
</section>

<section anchor="areas-of-work"><name>Areas of work</name>
<t>This section tries to identify and structure areas of work to address the aforementioned topics. We distinguish some core work, additional, and optional topics.</t>

<section anchor="core-areas-of-work"><name>Core areas of work</name>

<section anchor="enable-email-clients-to-understand-email-content"><name>Enable email clients to understand email content</name>
<t>As a primary building block for structured mail, there needs to be a specification on how to represent email content or metadata about email content in a machine-readable form.</t>
<t>Such an approach needs to be downwards compatible with existing practices and clients, probably in a similar way the &quot;multipart/alternative&quot; MIME part type is used for HTML email (&quot;text/html&quot;). <xref target="AMPemail"></xref> for instance, is leveraging this approach by adding an additional MIME part in conjunction with &quot;text/plain&quot; and &quot;text/html&quot;.</t>
<t>Regarding content semantics, <xref target="EMarkup"></xref> is using a subset of <xref target="SchemaOrg"></xref> types in a JSON-LD <xref target="W3C.REC-json-ld11-20200716"></xref> representation. Ideally an open approach to structured email would allow for decentralized extensions of content semantics.</t>
</section>

<section anchor="discovery"><name>Enable users to compose structured email</name>
<t>To sustain the decentralized and interactive nature of email, it is insufficient to just allow professional email senders to send structured emails to users. Instead, users need to be enabled to answer and also to initiate structured email exchanges.</t>
<t><xref target="AMPemail"></xref> (initiated by Google) and Microsoft Actionable Messages <xref target="AM"></xref> are approaches that allow users to reply with a structurd answer based on input forms. However, the required input is not machine-understandable, making it difficult to assist users in data entry. Both approaches allow responses to be sent to a dedicated HTTP API only, but not as a regular email response.</t>
<t>Both approaches also do not allow users to initiate a structured email exchange. For this purpose, a discovery mechanism would be required, which allows users resp. their email clients to determine which types of structured email content an intented email recipient is willing to accept besides standard email.</t>
</section>
</section>

<section anchor="cross-cutting-areas-of-work"><name>Cross-cutting areas of work</name>

<section anchor="internet-message-format-extensions"><name>Internet Message Format extensions</name>
<t>While structured email should be designed to maintain downwards-compatibility with the existing email technology stack, there might be some smaller helpful extensions:</t>

<ul spacing="compact">
<li>Additional email header information might inform clients about the presence of structured email content.</li>
<li>It might be helpful to distinguish structured email markup which represents the complete message content (in the sense of &quot;multipart/alternative&quot;) versus markup which provides partial annotations or metadata to regular email content. For the latter case, a standardized option to help clients identify such MIME parts (e.g., based on a MIME part header) might be useful.</li>
<li>A distinct characteristic of email technology is the multitude of processing options on both server-side or on client-side with multiple possible email clients operating on one account. This challenge is, e.g., also present in the email filtering design space <xref target="RFC5228"></xref>/<xref target="RFC5804"></xref>. Accordingly, there might be use cases in which users may want to direct structured email processing to certain clients (e.g., travel information to the mobile email app) in which additional header information might help guide processing.</li>
</ul>
</section>

<section anchor="trust"><name>Trust</name>
<t>(Semi-)automated processing of structured email requires increased scrutiny with respect to security issues. Accordingly, current vendor-specific approaches (<xref target="EMarkup"></xref>/<xref target="AMPemail"></xref>/<xref target="AM"></xref>) require DKIM <xref target="RFC6376"></xref> and/or SPF <xref target="RFC7208"></xref> set up in addition to a manual sender registration process.</t>
<t>While the latter is already a clear inhibitor of adoption, this process will not work if users shall be enabled to send structured mail by themselves - a scenario without a central gatekeeper. Accordingly, suitable measures of trust need to be in place. Candidate approaches might be a consolidation of existing email trust indicators, specific concepts of &quot;trusted senders&quot;, or drawing inspiration from sources such as the ACME protocol <xref target="RFC8555"></xref>.</t>
</section>

<section anchor="backend_apis"><name>Interaction with HTTP backend APIs</name>
<t>Structured email may benefit from certain server-side (HTTP) APIs, serving different purposes. An API for the discovery if an intended receiver supports structured email has already been discussed (see <xref target="discovery"></xref>).</t>
<t>Additional APIs could be useful to allow synchronous responses of structured data in certain uses cases in which an asynchronous message response cycle would be infeasible. All current vendor-specific approaches (<xref target="EMarkup"></xref>/<xref target="AMPemail"></xref>/<xref target="AM"></xref>) allow for some sort of direct data transmission against an HTTP backend API.</t>
</section>
</section>

<section anchor="optional-areas-of-work"><name>Optional areas of work</name>

<section anchor="email-search-and-filtering"><name>Email search and filtering</name>
<t>Machine-understandable interpretation of email content could also be used to improve some existing practices based on text/keyword-based processing of email content. In particular, the IMAP SEARCH command (<xref target="RFC9051" sectionFormat="of" relative="#" section="6.4.4"></xref>) and email filtering <xref target="RFC5228"></xref> might benefit from corresponding extensions. However, those might also be subject to work of other IETF working groups.</t>
</section>

<section anchor="external_apis"><name>Interaction with other data types and external APIs</name>
<t>Email clients have grown over years to support various types of user data beyond email. This typically involves contacts/address books, calendars, tasks, notes, or files.</t>
<t>Besides the case of iMIP <xref target="RFC2447"></xref>, interrelation between email and other data types is vendor-specific, if present at all. Structured email could help to enable easier interaction within email clients, with other applications on the same machine and with services on the internet (see also <xref target="personal_api"></xref>).</t>
<t>Ongoing work in the JMAP working group <xref target="RFC8620"></xref> could overlap with some of these aspects. The W3C Solid initiative <xref target="Solid"></xref> is another loosely related</t>
</section>

<section anchor="email-storage-formats"><name>Email storage formats</name>
<t>Several vendors use machine learning or data extraction approaches (see also <xref target="data_extraction"></xref>) to derive information about email content which is similar to or complementary to structured email content.</t>
<t>For interoperability and data portability, it would be helpful to standardize storage for such metadata. This might also help with current portability issues stemming from user-defined metadata such as IMAP flags (<xref target="RFC9051" sectionFormat="of" relative="#" section="2.3.2"></xref>).</t>
</section>

<section anchor="common-practices-in-email-processing"><name>Common practices in email processing</name>
<t>While structured email aims to provide a user-driven approach to make email content machine-understandable, there may be a number of more technical issues related to email-processing which might be addressed en route.</t>
<t>Besides consideration of already existing RFCs (see <xref target="existing"></xref>), this may involve currently unstandardized issues such as email signatures, vacation notices or &quot;noreply&quot; addresses. Those are probably more specific application areas of structured email, since they can impact multiple layers of email processing and corresponding standards.</t>
</section>
</section>

<section anchor="out-of-scope-areas-of-work"><name>Out of scope areas of work</name>
<t>This section lists aspects which are relevant to structured mail but which might not be addressed by formal standardization work in the scope of this proposal.</t>

<section anchor="data_extraction"><name>Data extraction</name>
<t>Several service providers or tools (e.g., <xref target="KDEItinerary"></xref>) apply data extraction techniques to derive structured data from textual email content. Data extraction is therefore an interesting approach to help bootstrapping the adoption of structured email, until it has become common practice among email senders.</t>
<t>While cooperation on shared data extractors might be a useful activity for parties interested in structured email, it is probably not a subject for standardization.</t>
</section>

<section anchor="ui-design"><name>UI design</name>
<t>Prescription of how user interfaces should display structured email or organize interaction with it are - besides illustration - typically out of scope of RFCs. As in the case of data extraction, there might however be a shared interest among vendors to collaborate on certain best practices.</t>
</section>
</section>
</section>

<section anchor="privacy-considerations"><name>Privacy considerations</name>
<t>Since email content often contains personal data, it is subject to privacy considerations. From a high level perspective, privacy issues in structured email should not significantly differ to privacy issues in existing email standards.</t>
<t>On a more fine granuar level, structured email might both raise certain novel privacy issues (i.e., since structured data is more easy to process and share), but it could also improve certain privacy concerns. For instance, it could make certain data extraction practives (see <xref target="data_extraction"></xref>) obsolete, which currently cannot distingish senstive from non-sentitive parts of an email.</t>
</section>

<section anchor="security"><name>Security considerations</name>
<t>As a problem statement document, no particular security considerations apply as such. Similar to privacy considerations, security issues might also widely overlap with those of existing email standards.</t>
<t>See <xref target="trust"></xref> for the discussion of some general security aspects with respect to structured email.</t>
</section>

<section anchor="iana-considerations"><name>IANA Considerations</name>
<t>This document has no IANA actions at this time.</t>
</section>

</middle>

<back>
<references><name>Informative References</name>
<reference anchor="AM" target="https://learn.microsoft.com/en-us/outlook/actionable-messages/">
  <front>
    <title>Actionable Messages</title>
    <author>
      <organization>Microsoft Inc.</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="AMPemail" target="https://amp.dev/about/email/">
  <front>
    <title>AMP email</title>
    <author>
      <organization>OpenJS Foundation</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="EMarkup" target="https://developers.google.com/gmail/markup">
  <front>
    <title>Email Markup</title>
    <author>
      <organization>Google Inc.</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="KDEItinerary" target="https://apps.kde.org/itinerary/">
  <front>
    <title>KDE Itinerary</title>
    <author>
      <organization>KDE e.V.</organization>
    </author>
    <date></date>
  </front>
</reference>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.2447.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5228.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5804.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.6376.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.6477.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.7208.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8058.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8098.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8555.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8620.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.9051.xml"/>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.9078.xml"/>
<reference anchor="SchemaOrg" target="https://schema.org/">
  <front>
    <title>Schema.org</title>
    <author>
      <organization>W3C Schema.org Community Group</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="SmartInbox" target="https://postmaster.web.de/en/smartinbox">
  <front>
    <title>WEB.DE Smart Mailbox</title>
    <author>
      <organization>1&amp;1 Mail &amp; Media GmbH</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="Solid" target="https://solidproject.org/TR/protocol">
  <front>
    <title>Solid Protocol</title>
    <author>
      <organization>W3C Solid Community Group</organization>
    </author>
    <date></date>
  </front>
</reference>
<xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml-w3c/reference.W3C.REC-json-ld11-20200716.xml"/>
<reference anchor="YahooPS" target="https://senders.yahooinc.com/promotions-and-schema/">
  <front>
    <title>Promotions &amp; Schema</title>
    <author>
      <organization>Yahoo Inc.</organization>
    </author>
    <date></date>
  </front>
</reference>
<reference anchor="ZohoQAES" target="https://www.zoho.com/mail/help/quick-actions.html">
  <front>
    <title>Quick Actions and Email Snippets</title>
    <author>
      <organization>Zoho Corporation Pvt. Ltd.</organization>
    </author>
    <date></date>
  </front>
</reference>
</references>

</back>

</rfc>
