Security and Privacy

A Security Model for Web-Based Communication

In this paper, we introduce a generic security model for Web services based on the dimensions of resolution, transaction, and identification.

By Pouyan Fotouhi Tehrani, Eric Osterweil, Thomas C. Schmidt, and Matthias Wählisch

Posted Aug 28 2024

Introduction
A Security Model for Web-Based Communication
DNS Namespace Analysis
Web PKI Analysis
How to Proceed Further?
Conclusion and Outlook
Acknowledgments
References

Read the related Technical Perspective

Abstract

Web access involves various protocols to resolve domain names to IP addresses, establish data exchange channels with Web servers, and to authenticate communication partners. Each protocol has its own set of requirements and security measures. In addition to technical features, operating the Web also introduces organizational and political aspects which are important to consider when deploying a secure basis for Web-based communication. In this paper, we propose an algorithmic security model based on the widely deployed technologies DNSSEC and Web PKI to cover the three dimensions identification, resolution, and transaction. Our model enables quantification and qualification of the security assurance provided by an online service provider. To verify the applicability of our model, we investigate the online presence of Alerting Authorities in the U.S., selected German Emergency Service providers, and UN member states. We observe partially enhanced security relative to global Internet trends, yet find cause for concern as only about 6% of unique hosts cater to secure resolution. About 46% of investigated organizations use shared certificates with 1% of all organizations having no or invalid certificates. Two thirds of organizations are not uniquely identifiable and as such lack the basic requirement of trustworthy communication.

1 Introduction

A fundamental base for leveraging Web content is trustworthy communication with the remote content endpoint. Assessing the level of provided security is challenging, though, due to the complexity of the current Web, which involves multiple protocols and stakeholders when accessing a Web page as depicted in Figure 1. To demonstrate the communication workflow and respective security pitfalls, we consider the simple case of inquiring information about COVID-19 guidance as a resident of Jackson County in Missouri. Through a search engine, an online ad, a recommendation from friends, among others, the URL is quickly discovered: https://jacohd.org. When visiting the Website, the presence of a green padlock in the address bar indicates only the confidentiality and integrity of data exchange, but does not indicate whether the Website belongs to the supposed service (that is, Health Department of Jackson County in Missouri instead of one of the other 22 Jackson Counties). The generic domain name could have been registered and operated by anyone. An attacker could have published a forged Website implementing the look and feel of the real health department.

At no stage does the user receive a chance to authenticate the identity (that is, identification) of the service provider because the supplied domain validation (DV) certificate does not include any identification information. In contrast, the health department of Jackson County in Michigan, reachable under www.co.jackson.mi.us, is under a restricted top-level domain (TLD), that is, registration required an eligible entity, which indicates that it belongs to Jackson County (co.jackson) in Michigan (mi.us). It is accompanied by an extended validation (EV) certificate, which serves as a definitive proof of identity.

In this paper, we introduce a generic security model for Web services based on three dimensions of resolution, transaction, and identification. Using this model, we provide a framework to understand whether and how specific integration of a service provider in the domain namespace and its use of DNSSEC and X.509 certificates impact secure communication (see Figure 1). Our model allows us to calculate an Assurance Score for a given service provider and qualify its level of security as an inadequate, a weak, or a strong Assurance Profile. We apply our model in practice by systematically investigating the Web representations of organizations active in emergency management or diplomatic relations. The starting points for our study are three datasets: (i) the list of Alerting Authorities (AA) in the U.S. (US-AA), (ii) list of fire departments, police, municipalities, and Red Cross chapters in 200 most populous cities in Germany (DE-EM), and (iii) list of Websites of Ministries of Foreign Affairs (MFA) used to create Diplomatic Pulse, a search engine for official press releases of UN Member States.²³ We analyze domain names and certificates of all these Websites.

This paper complements our previous study¹² by contributing a solution for general Web communication, instead of limiting the analysis to Alerting Authorities. Our key finding is that none of the investigated Web services provide strong security measures, but 95%-99% exhibit an inadequate Assurance Profile, depending on the type of service. This decomposes as follows: (i) only about 6% of observed hosts provide secure resolution and are susceptible to name spoofing and certificate misissuance, (ii) transaction security is dominated by domain validation certificates and about 1% of all investigated services are provided by servers delivering none or invalid certificates, none reaching the secure transaction level, and, finally, (iii) two thirds of investigated organizations cannot be uniquely identified due to shared names or certificates, or just by using domain validation certificates that provide no identity information.

2 A Security Model for Web-Based Communication

We define a security model for web-based communication based on three dimensions: (i) securely verifying that users have not been misdirected (“resolution” of name to network service), (ii) ensuring that the content was not altered, privacy was not leaked, etc. during the session (“transaction” security), and (iii) securely authenticating the authoritative service (“identification” of the person, organization etc. behind the service name). Although different methods can be utilized to implement such a secure workflow, here we focus on those technologies that are most accessible to (and deployable by) users and service operators in the current Internet, namely the DNS and the Web PKI.

Almost all transactions on the Web start with looking up a domain name, for example, www.example.com, and resolving it to an IP address, for example, 93.184.216.34. The domain name system (DNS), the hierarchical distributed database that enables such domain name resolution, is susceptible, though, to a number of vulnerabilities which can end with domain hijacking and impersonation. To counter, a set of security extensions dubbed as DNSSEC have been introduced that secure name resolutions.

After retrieving an IP address, a communication channel is established. The Transport Layer Security (TLS) protocol is the de facto standard to secure this channel. TLS was designed to provide integrity, that is, modified data in transit can be detected at both ends, and confidentiality, that is, no party other than the communicating endpoints can comprehend the exchanged data. To achieve any of these two security goals, authentication of the contacted endpoint is required. Without proper authentication, a malicious party can intercept or impersonate any endpoint while also providing integrity and confidentiality for this impersonated endpoint or proxy.

Authentication can succeed by pre-sharing keys in controlled environments or through X.509 certificates in open environments like the Web. A certificate binds a cryptographic (public) key to a subject which compromises a set of attributes such as organization, country, common name and can be further elaborated by Subject Alternative Names (SAN) like domain names and IP addresses. Its authenticity is vouched for by a trusted third party called a Certification Authority (CA). The most basic form of a subject is a domain name. A certificate holder can prove the ownership of that name based on a cryptographic signature. This is referred to as domain validation or DV for short. A subject can also map to a natural or legal person in the real world. In such cases, we speak of identification as an extension of authentication. Organization and Extended Validation (OV and EV) certificates fall under this category. In a secure setting, it would be possible to identify and authenticate the communication partner before initiating the transaction. Yet, the starting point for Web communications are domain names, which cannot be used for secure identification, while Web PKI certificates (as proofs of identity) are provided only after resolution succeeds and a transaction is initiated. We argue that domain names that are under restricted namespaces, for example, .gov for U.S. governmental organizations, are a necessary (yet not sufficient) precursor for identification.

Authentication only holds if the certificate itself is authentic, for example, not misissued. Within the Web PKI ecosystem, any trusted CA can issue a certificate for any arbitrary name. A rogue or a compromised CA can, thus, be tricked into issuing a valid certificate to an entity that does not match the certificate subject. To counter this threat, the DNS-based authentication of Named Entities (DANE) was introduced, which extends the DNSSEC trust to the Web PKI by signaling clients what (type of) certificate to expect from the server. This is an additional security measure to prevent adversaries from presenting bogus certificates to clients in case of server or CA compromises even if name resolution succeeds securely.

In our security model, we consider resolution to be secure if it is backed by DNSSEC. We consider a secure transaction to be reached if a valid, dedicated certificate matching DANE records is provided. If DANE is missing or the certificate is shared between distinct service providers, we classify it as a weak transaction (see Figure 2). Regarding identification, we consider the highest assurance level to be reached if a service endpoint is under a restricted namespace, provides an OV or EV certificate, and caters to secure resolution (see Figure 3).

Classifying transactions using certificate, name, and handover characteristics (✗ Insecure, ! Partial, ✓ Secure) — Figure 2. Classifying transactions using certificate, name, and handover characteristics ( $✗$ Insecure, ! Partial, $✓$ Secure)

Classifying identification based on name, namespace, certificate, and trust chain characteristics (✗ Insecure, ! Partial, ✓ Secure) — Figure 3. Classifying identification based on name, namespace, certificate, and trust chain characteristics ( $✗$ Insecure, ! Partial, $✓$ Secure)

Finally, to provide a comprehensive view on all three security dimensions, we introduce Assurance Scores and Assurance Profiles. An Assurance Score defines a measure that combines the contributions of each security dimension to web-based communication. Each Score is then mapped to a Security Profile, which groups multiple deployment options into strong, weak, or inadequate. In detail, we first codify each possible label of insecure, partial, and secure as 0, 1, and 2, respectively. Since resolution can either be secure or insecure it is codified with 0 or 2. Then, we calculate the Assurance Score as $r e s o l u t i o n + t r a n s a c t i o n + 2 \times I d e n t i f i c a t i o n$ . We add extra weight to identification because this dimension is integral in secure communication—even if resolution and transaction are both secure, there is still a need to at least partially identify the communication partner. This weighting represents an example that illustrates our basic contribution. Our general contribution would allow users to employ different weightings as well.

3 DNS Namespace Analysis

In this section, we aim to answer the following questions by studying domain names:

Does each service provider have its own dedicated domain name?
How do service providers integrate in the global DNS namespace?
Do service providers secure their names using DNSSEC?

The first question is concerned with how an organization maintains its online presence without introducing unnecessary dependencies. Lack of a dedicated name, for example, leads to dependence on someone else for authentication and data security as X.509 certificates are bound to domain names. The second question aims to investigate the preference regarding TLDs to take advantage of recognizability (for example, governmental organization under .gov) and security (restricted vs. non-restricted TLDs). The last question considers measures taken into account to secure names against threats such as spoofing or DNS hijacking, which can also lead to impersonation and phishing. Table 1 summarizes our findings.

Table 1.

Top-level domains in use by organizations studied in this work.

Type	TLD		Registration		Registry		Domain share (%)			Domain count (#)			DNSSEC support (#)
Type	Label	DNSSEC	Restricted	Fee/year	Name	Country	`US-AA`	`DE-EM`	`MFA`	`US-AA`	`DE-EM`	`MFA`	`US-AA`	`DE-EM`	`MFA`
gTLD	.com	$✓$	$✗$	< 15 $	Verisign	U.S.	18.23%	0.25%	0%	325	2	0	12	0	–
	`.info`	$✓$	$✗$	< 15 $	Afilias¹	U.S.	0.05%	0%	0%	1	0	0	0	–	–
	`.net`	$✓$	$✗$	< 15 $	Verisign	U.S.	4.53%	0.25%	0%	81	2	0	2	0	–
	`.org`	$✓$	$✗$	< 15 $	PIR	U.S.	24.94%	0.62%	1.18%	446	5	2	31	0	1
							47.76%	1.12%	1.18%	854	9	2	45	0	1
ccTLD	`.<ISO-3166>`	( $✓$ )^†	( $✗$ )	–	–	–	4.19%	88.97%	28.99%	75	710	49	0	44	7
ccSLD	`.<state>.us`	$✓$	$✓$	–	Neustar	U.S.	13.98%	0%	0%	250	0	0	5	–	–
	`.*.<ISO-3166>`	( $✓$ )	$✓$	–	–	–	0%	0%	69.23%	0	0	117	–	–	8
							13.98%	0%	69.23%	250	0	117	5	–	8
sTLD	`.edu`	$✓$	$✓$	77 $	Educase²	U.S.	0.50%	0%	0%	9	0	0	–	–	0
	`.gov`	$✓$	$✓$	^‡ 0 $	CISA³	U.S.	32.32%	0%	0.59%	578	0	1	69	–	0
	`.mil`	$✓$	$✓$	–	DoD NIC	U.S.	1.23%	0%	0%	22	0	0	14	–	–
							24.06%	0%	0.59%	609	0	1	83	–	0
geoTLD	`.eu`	$✓$	( $✗$ )	< 10 $	EURid	BE	0%	0.25%	0%	0	2	0	–	0	–
	`.hamburg`	$✓$	$✗$	≈ 50 $	Hamburg Top-Level-Domain GmbH	DE	0%	0.12%	0%	0	1	0	–	0	–
	`.nrw`	$✓$	$✗$	≈ 30 $	mmx.co⁴	U.S.	0%	9.52%	0%	0	76	0	–	0	–
							0%	9.89%	0%	0	79	0	–	0	–
									Total	1788	798	169	133	44	16

1 Acquired by Donuts Inc. 2 subsidiary of Verisign, 3 Previously managed by General Services Management, 4 Minds + Machines GmbH operated by GoDaddy † See ICANN TLD DNSSEC support [15] ‡ Fees were dropped from 400 $ after takeover by U.S. CISA in 2021

Datasets: U.S. Alerting Authorities (US-AA) / German emergency management (DE-EM) / Ministries of foreign affairs (MFA)

3.1 Dedicated Domain Names

We consider an entity to have a dedicated DNS name either if it has its own effective SLD, or has been assigned a sub-domain under the namespace of its parent organization or any generic service provider that is not shared. For example, the Tehama County Sheriff (tehamaso.org) has its own dedicated name whereas Apache County Sheriffs Office (www.co.apache.az.us/sheriff/) does not.

Out of the total 1,788 collected URLs for the US-AA dataset, 1,747 unique domain names exist, showing that in some cases multiple entities are subsumed under the same domain, for example, different agencies all under the domain name of a single state. For the DE-EM dataset, there exist 595 unique domain names for 798 collected URLs. All 169 collected URLs for the MFA dataset are under distinct domain names.

The share of dedicated names varies largely among different fields of operation. Ministries of Foreign affairs have their own dedicated name. Among the U.S. Alerting Authorities all educational entities (total of 6) and over 90% of governmental entities (628 out of 673) have dedicated names. In contrast, $\approx 23 %$ of public safety entities (197 out of 852), and less than 30% of law enforcement (68 out of 216) and military organizations (6 out of 19) have their own names. In the case of German Emergency providers, 97% of the municipality portals (198 out of 200) and German Red Cross chapters (194 out of 199) are served by dedicated names, followed by 59% of fire departments (118 out of 200), and 23% of police departments (46 out of 199).

3.2 Namespace Structure

To analyse namespace structures, we group various TLDs and country code second-level domains (ccSLDs) as follows:

gTLD: generic top-level domain (for example, .org).
ccTLD: country code top-level domain (for example, .us).
ccSLD: country code second-level domain (for example, gouv.fr).
sTLD: sponsored top-level domain (for example, .mil).
geoTLD: top-level domains for “geographical, geopolitical, ethnic, social or cultural representation” (for example, .eu).

Each TLD group features different properties. In general, there are little to no delegation limits and naming conventions for names under gTLDs or ccTLDs, except for the .us namespace. Under the .us ccTLD more than 3,000 names are reserved and unavailable for public registration. This DNS sub-tree has a rigorous structure on the second, third, and fourth level. The structuring reflects the “political geography” (see RFC1480) and defines a number of reserved names for designated organizations or purposes and territory of operation (for example, county or city). Sponsored TLDs (.edu, .gov, and .mil) impose stricter eligibility requirements and thus have an advantage over gTLD names as only eligible registrants are granted domain name ownership in these namespaces. The sTLDs, however, are mainly limited to the U.S. so that non-U.S. entities are forced to make use of local counterparts under their respective country code SLDs, for example, .ac.uk for academic institutes in the UK or gouv.fr for the French government.

The majority of US-AA entries are located under gTLD namespaces followed by sTLD, ccSLD, and ccTLD namespaces (see Table 1). It is noteworthy that the .us locality namespace (part of .us ccSLD) exhibits a relatively low penetration among AAs. When considering the canonical forms [ci,co].<locality>.<state-code>.us for cities or counties, we observe only 9 out of 66 actual cities that use this naming pattern. Similarly, for names that include the term county, there are only 218 of 696 cases using the canonical form.

In contrast to Alerting Authorities, DE-EM and MFA organizations face more limitations in choice of appropriate and restricted namespaces. For instance, none of the entries in DE-EM are under restricted namespaces ( $\approx$ 90% under .de), and the majority of MFAs resolve to ccSLD namespaces of their countries (for example, .gov.af).

Finally, we examined whether the specific choice of top-level domains of an organization correlates with its field of operation. Figure 4 depicts how widely used various TLD types are in different fields of operation. The majority of Alerting Authorities opt for gTLDs with the exception of educational and military organizations that exclusively take advantage of restricted TLDs (.edu and .mil respectively). DE-EM and MFA organizations primarily choose ccTLD namespaces.

Figure 4. Distribution of TLD types per operation territory.

3.3 DNSSEC Deployment

All investigated TLDs, except a number of ccTLD and ccSLD namespaces, are DNSSEC-enabled (see Table 1). Nonetheless, we only observe 7.4%, 5.5%, and 9.4% of entities that are part of DNSSEC-enabled zones in US-AA, DE-EM, and MFA datasets. It is noteworthy that 77.6%, 70.6%, and 35.5% of entities in respective lists are directly under DNSSEC-enabled zones and only need to activate DNSSEC on their own authoritative nameservers, while 13.1%, 1%, and 53.2% are respectively 2 or more zones deeper in the DNS hierarchy and need at least one predecessor zone to enable DNSSEC before they can. In total, only $\approx 6 %$ of unique hosts that we analyzed are signed and secure, which is still a higher DNSSEC penetration compared to the longitudinal study by Chung et al.,⁶ who measured 0.6% for .com and $1.0 %$ for .org domains.

Only about 12% (65 out of 557) of unique .gov names support DNSSEC. This is surprising for two reasons. First, .gov names are mandated to implement DNSSEC. Second, the DNSSEC penetration is much lower than the $\approx 90 %$ DNSSEC penetration among selected governmental organizations (sample set of approximately 1200 .gov SLDs) as measured by NIST.¹⁷

4 Web PKI Analysis

The ecosystem of Web PKI revolves around X.509 certificates. We investigate the deployment and characteristics of certificates to answer the following questions:

To what extent do entities embrace the Web PKI?
How and for what purposes (identification vs encryption) are certificates utilized?
How is the historic landscape of X.509 shaped among organizations in our dataset?

4.1 Current Deployment of Web PKI

We first want to gain a better understanding of the current deployment of Web certificates by analysing a snapshot of TLS deployment across public servers from our datasets.

The highest TLS deployment is among German emergency services ( $\approx 99 %$ ), also with the lowest number of invalid certificates ( $< 2 %$ ). Comparably, $\approx$ 98% of U.S. Alerting Authorities support TLS with only 2% erroneous certificates. Among Ministries of Foreign Affairs, 15% lack TLS support and nearly 15% of the certificates are unverifiable. Failed verifications are caused by server misconfigurations, self-signed or expired certificates. For validation, we use OpenSSL trusted root certificates (on Ubuntu Linux). In contrast to other Web PKI studies, we see less invalid certificates on average in our sample; Chung et al.⁵ observed $65 %$ on average considering the global IPv4 space in 2016, and Durumeric et al.⁹ found $\approx 13 %$ measuring Alexa 1M top domains in 2013.

4.2 Certificate Usage

Although the technical basis of all X.509 certificates is the same, the semantics of a certificate are defined by the governing validation policies and how a certificate is in use. A Web PKI X.509 certificate must at least fulfill domain validation, that is, authenticate the ownership of a domain name. For OV and EV certificates, the identity of the certificate holder (to various degrees) can be authenticated as well. Identification assurances, however, can be diluted through certificate sharing. Except EV certificates, both DV and OV certificates allow wildcard names as subject alternative names (SAN) to avoid enumerating all FQDNs under the control of the certificate holder. In practice, the SAN extension also allows sharing a certificate among different hosts. For example, in 2019, the U.S. federal government issued OV certificates with more than 600 specific and wildcard SAN entries. Certificate sharing, on the one hand, defeats the purpose of identification and, on the other hand, expands the attack surface and increases operational costs so that if one of the hosts is compromised or the certificate is revoked, every other host necessarily also needs to be configured with a new certificate (sometimes called “fate-sharing”).

To detect certificate sharing, we count the number of unique domain names in the SAN section of a certificate while ignoring www.* subdomains. We consider any SAN count greater than 1 or the existence of wildcard names as an indication of a shared certificate. In the case of EV certificates and regardless of the number of SANs, the certificate is considered dedicated as the certification authority is mandated to ensure that all alternative names are representations of one and the same entity. This is not necessarily the case for OV certificates as we discuss below.

For those organizations behind an SSL/TLS enabled server, we observe that only a small fraction is represented by an EV certificate, $\approx 2 %$ for US-AA, 1% for DE-EM, and $\approx 3 %$ for MFA entries. As we discuss later in Section 5, this is likely due to a decision by major browser vendors to abandon visual cues for different validation types altogether.¹¹ OV certificates amount to one third for entries in DE-EM and MFA datasets each, and 22% for US-AA; 84%, 82%, and 90%, respectively, using shared certificates. The remaining majorities are represented by DV certificates with a portion of 40% (US-AA), 41% (DE-EM), and 57% (MFA).

We observe common deployment models that make use of shared certificates. Most prominently, Cloudflare uses its own CA to issue an OV certificate to itself, that is, ‘Cloudflare, Inc.’ as organization name and sni.cloudflaressl.com as common name, and includes a customer domain name as SAN. In this model, despite the organization validation, the certificate cannot be used to identify the service provider, for example, Wilco County in Texas under www.wilco.org provides a certificate that does not relate to its infrastructure provider Cloudflare. Another model, observed in the US-AA dataset, is when a certificate is shared among different customers of a multitenancy cloud provider (dubbed as “cruise-liner certificates” by Cangialosi et al.³). For instance, Granicus, a cloud provider for governmental organizations in the U.S., uses Let’s Encrypt DV certificates to create a single certificate for many customers at once. Here, not only is identification ambiguous but also confidentiality is jeopardized by shared server keys, leaving customers insufficiently isolated from each other. It is worth noting that Let’s Encrypt certificates only allow up to 100 DNS type SANs. The disadvantage of such an approach compared to the former model is that a single compromised or revoked certificate affects multiple unrelated customers, often called “fate sharing.” Beside cloud service providers, individual organizations also commonly make use of shared certificates to have a single certificate to subdomains that are all managed by a single technical entity.

Table 2 combines our findings from this Section and Section 3 to reveal different combinations of DNS and X.509 certificate characteristics, linked to different levels of assurance. In 3, we group our results by organization types. Due to low DNSSEC deployment, lack of TLSA records, popularity of open TLDs, and pervasiveness of DV certificates, the majority of hosts fall under the inadequate Assurance Profile.

Table 2.

Proposed security model and measurement results, which exhibit specific security features per dataset.

	Security Model					Measurement Results
	Security Dimensions			Assurance		Unique Hosts [#]
#	Resolution	Transaction	Identification	Score	Profile	`US-AA`	`DE-EM`	`MFA`
1	$✗$	$✗$	$✗$	0		93	27	11
2	$✗$	$✗$	!	2		68	0	27
3	$✗$	!	$✗$	1		781	709	30
4	$✗$	!	!	3		713	18	85
5	$✓$	$✗$	$✗$	2		2	2	1
6	$✓$	$✗$	!	4		1	0	4
7	$✓$	!	$✗$	3		43	40	6
8	$✓$	$✓$	$✗$	4		0	0	0
					Total inadequate:	1701 (95%)	796 (99%)	164 (98%)
9	$✓$	!	!	5		87	2	5
10	$✓$	$✓$	!	6		0	0	0
					Total weak:	87 (5%)	2 (1%)	5 (2%)
11	$✓$	$✓$	$✓$	8		0	0	0

Measurable security dimensions (✗/0 : insecure, !/1 : partial, ✓/2 : secure) determine Assurance Scores (resolution + transaction + 2 × Identification) aggregated into Assurance Profiles (Image strong, Image weak, Image inadequate).

Table 3.

Number of unique hosts per Assurance Profile grouped by field of operation.

Dataset	Field of operation	Assurance Profile
Dataset	Field of operation
`US-AA`	Educational	6	0	0
	Governmental	646	28	0
	Law Enforcement	206	10	0
	Military	7	12	0
	Public Safety	819	33	0
	Other	18	4	0
`MFA`	Ministry of Foreign Affairs	164	5	0
`DE-EM`	Fire Department	200	0	0
	Municipality	200	0	0
	Police	197	2	0
	Red Cross	199	0	0

Image strong, Image weak, Image inadequate (see Table 2)

4.3 Historic X.509 Certificate Landscape

The historic analysis of X.509 certificates collected from Certificate Transparency logs helps us to gain a better understanding of security policy changes related to examined entities and CAs. We span eleven years. It should be noted that the total number of organizations providing publicly logged certificates changes for each year. We consider this in the following by normalizing the results either with respect to the number of unique hosts or the total number of certificates valid per year.

Certification Authorities. In addition to common regulations, certification authorities implement and follow their own set of policies. From the perspective of relying parties, that is, Web users, such policies are opaque and as long as a CA is included in a user’s trust store, it is considered trustworthy. For the subscribers, however, these policies among other factors such as fees, offered certificate types, and operation costs are decisive in choosing an appropriate CA.

We focus our analysis on certification authorities with a market share of at least 20% in any year during the past decade. The calculation of market share is based on the number of covered subscribers. We use the term cover to differentiate from issuance: if a subscriber, for example, received a certificate from a CA valid from 2014 to 2016, we consider it to be covered by that CA for 2014, 2015, and 2016. Further, if a CA issues multiple short-lived certificates (for example, 90 days) for a subscriber within a given year, we only count that host as covered once in that year by the issuing CA.

Figure 5 depicts our findings in terms of relative market share development in the past decade. Compared with the CA market share for the Alexa 1M top domain list throughout the past decade¹^,⁹^,¹⁴ we observe parallels, such as decline of the GoDaddy market share and rapid gains of Let’s Encrypt, as well as discrepancies that cannot directly be explained due to dynamic nature of and fluctuations in the Alexa top list. This figure also highlights two factors that were evidently decisive for the organizations in our datasets in their choice of CA: convenience and cost. GoDaddy (among US-AA) and T-Systems (among DE-EM), for example, which have been the market leaders at some points in time, provide Web hosting and domain name registration in addition to certification services in convenient packages. In contrast, Let’s Encrypt, which has surged to the top in the short period after its public offering, offers automated DV certification at no cost.

Figure 5. CA Market share per year grouped by dataset.

Validation types. In Section 2 we discussed the role of OV/EV certificates for secure identification. Although the current deployment of investigated entities (see subsection 4.1) reveals a low penetration of such certificates, historically a higher share of entities made use of OV/EV certificates as depicted Figure 6. The surging popularity of free and automated DV certificates (see Figure 5) has evidently led to a decrease in usage of OV/EV certificates.

Figure 6. Development of certificate types in the past decade

Certificate Sharing. Multitenant Web hosting and security service providers make use of shared certificates as discussed above. Figure 7 depicts the development of certificate sharing based on our analysis of historic data. Here, we also notice an increasing number of certificates shared across hosts which do not belong to the same logical entity. Most critically, also among OV certificates, we see service providers obtainngs certificates under their names and listing the hosts of their customers as SANs. This practically defeats the identification purpose of the certificates. At the time of writing, we observe cases of such certificates listing SANs that clearly belong to separate entities, for example, mo.gov, asap.farm, and incapsula.com within the same certificate. In this very specific case, records from the Wayback Machine archives show that asap.farm has previously belonged to Missouri Department of Agriculture but it was never removed from the certificate even though the domain name registration was transferred to another entity.

Figure 7. Share of host names represented by certificates with more than 2 unique SAN entries (excluding EV certificates)

5 How to Proceed Further?

Our results draw a rather alarming picture of how security measures of Web-based communication are implemented and integrated. We now discuss some possible reasons for the observed deficiencies and suggest alternatives.

Possible reasons for lack of security. Trustworthy Web communication is complex because it involves consumers as well as infrastructure operators such as CAs, ISPs, and browser vendors. CA practices and guidelines diverge,⁸ some automated DV certification services are susceptible to impersonation attacks²⁰^,²² and some root CAs do not restrict certification scope for their intermediate CAs.⁹ The high fraction of invalid and shared certificates might relate to carelessness regarding the Web PKI trust model (self-signed certificates) or additional (not only financial) configuration¹⁶ and certification costs. Higher costs and complexity are also the reason why certificate owners rarely make use of OV/EV certificates.

DNS registrars do not always offer DNSSEC by default or free of cost⁷ and not all ISPs bother to operate DNSSEC-aware recursive resolvers that properly verify signed DNS records.⁶^,²⁴ Missing DNSSEC support¹⁵ can partly be traced back to its complexity, for example, correct implementation of key transitions.¹⁸ This might also be the reason why some domain names under .gov do not support DNSSEC even though DNSSEC is mandatory under this TLD.

Provide comprehensive security assessment to end users. The complexity of secure communication must no longer be hidden from end users. Rather, there is now a more pressing need than ever before for users to be educated in better understanding the semantics of domain names¹⁹^,²¹ and Web PKI certificates and their practical use. Instead of the recent tendency by browser vendors to remove security signals, we must improve visual security cues and expand them in meaningful ways to different levels in Web browsers. Confusing TLS warnings² should be avoided and EV indicators should not be eliminated but enhanced.¹¹ Offering alternative CA trustworthiness assessment measures beyond the standard binary trust model⁴^,¹⁰^,¹³ could be an additional option. The security model proposed in this article illustrates a clear example of a multi-dimensional framework that is designed to quantify this complex landscape. Our model is well suited to being visually integrated into a concise mechanism to present meaningful signals to end users.

Integrate routing security. Our security model should next be enhanced to consider routing security. Threats such as route hijacking expose communication to eavesdropping and interception. When traffic is intercepted, attackers may undermine end-to-end security by using a rogue certificate or outright prevent Web services by dropping traffic. Such a Denial of Service (DoS) cannot be protected by end-to-end encryption at all.

Integrating routing in the assessment of security is important to provide a comprehensive view. Similar to the verification whether a domain name is protected by DNSSEC, an enhanced assurance score should be developed to include whether IP prefixes of Web servers or DNS name servers are secured by the Resource Public Key Infrastructure (RPKI).

Providing information from lower layers and granting control over those layers to the application layer can allow implementing extensions directly in browsers, enabling users to choose settings to reach higher assurance scores independently of preconfigured browser or operating system behavior. One approach could be advanced Path-aware Networking (PAN) combined with geofencing where Web applications select Internet paths that are under a specific jurisdiction.

Leveraging content object security. Current Web security relies on end-to-end security during transmission. This poses a challenge for two reasons. First, security channels need to be established per application endpoint, and, second, it introduces security dependencies when content distribution is provided by third parties (for example, Content-delivery Networks). One way to overcome these challenges is to secure the data itself instead of the communication channel. This is known as object security and has found attention in IoT (for example, OSCORE) and Information-Centric Networking (ICN). In this security model, data is enhanced with cryptographic features that enable self-certification or self-authentication, and is protected both while in flight and while at rest.

6 Conclusion and Outlook

In this article, we conceptualized a security model to algorithmically quantify and qualify trustworthiness in Web communication. This security model covers three security dimensions (resolution, transaction, identification), which are combined into Assurance Scores grouped into Assurance Profiles. Assurance Profiles provide a lean, yet powerful view on the security support of a specific Web service. They may serve as an advancement in future Web browsers to hide the complexities in secure Web communication by comprehensively notifying end users whether the level of security is strong, weak, or inadequate.

To showcase the applicability of our model in practice, we analyzed three classes of Websites that are relevant to the public, Alerting Authorities in the U.S., selected emergency management organizations in Germany, and input for the Diplomatic Pulse, a search engine for official press releases of UN Member States. We measured the deployment of common technologies providing potential security features (namespaces, DNSSEC, and Web PKI) for these Websites and embedded them in our model.

We uncovered deficiencies and discussed alternatives while emphasizing that common solutions are not necessarily technical but operational as well as political. Protecting Web-based communication, specifically in context of critical infrastructure and public safety, entails addressing operational and policy challenges on national and international levels and calls for commitment of all stakeholders from service providers to intermediate infrastructure operators and browser vendors alongside policy-makers.

In the future, when orthogonal security paradigms arise, such as content object security, or when emerging but fragmented edge networks, such as the Internet of Things, gain importance, the security model proposed in this article shall be extended, adapted, and applied. We urgently remind our community that complexity will always arise and demands for an automated and comprehensive security assessment integrated in end user software.

Acknowledgments

We would like to thank the Innovation Cell in the United Nations Department of Political and Peacebuilding Affairs (UN DPPA) for providing us the list of sources used in the Diplomatic Pulse database. We thank Jakob Bork for helping with the verification of German Emergency Service providers.

This work was supported in parts by the German Federal Ministry of Education and Research (BMBF) within the projects Deutsches Internet-Institut (grant no. 16DII111) and PRIMEnet.

Submit an Article to CACM

CACM welcomes unsolicited submissions on topics of relevance and value to the computing community.

You Just Read

A Security Model for Web-Based Communication

View in the ACM Digital Library

DOI

10.1145/3623292

October 2024 Issue

Vol. 67 No. 10

Pages: 83-90

Table of Contents

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Explore More

BLOG@CACM Dec 20 2024

Strengthening Security Throughout the ML/AI Lifecycle

Alex Vakulov

Artificial Intelligence and Machine Learning

News Dec 18 2024

iBuyers, AI, and Real Estate

Gregory Goth

Architecture and Hardware

BLOG@CACM Dec 17 2024

Zero-Trust Security in Software Development

Harikrishna Kundariya

Computing Profession

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More

Abstract

1 Introduction

2 A Security Model for Web-Based Communication

3 DNS Namespace Analysis

3.1 Dedicated Domain Names

3.2 Namespace Structure

3.3 DNSSEC Deployment

4 Web PKI Analysis

4.1 Current Deployment of Web PKI

4.2 Certificate Usage

4.3 Historic X.509 Certificate Landscape

5 How to Proceed Further?

6 Conclusion and Outlook

Acknowledgments

A Security Model for Web-Based Communication

DOI

October 2024 Issue

Related Reading

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

Shape the Future of Computing

Communications of the ACM (CACM) is now a fully Open Access publication.