;login: Summer 2019 R8 Test 2

SREcon19 Asia/Pacific

June 12–14, 2019, Singapore

www.usenix.org/srecon19asia

2019 USENIX Annual Technical Conference

July 10–12, 2019, Renton, WA, USA

www.usenix.org/atc19

Co-located with USENIX ATC ’19

HotStorage ’19: 11th USENIX Workshop on
Hot Topics in Storage and File Systems

July 8–9, 2019

www.usenix.org/hotstorage19

HotCloud ’19: 11th USENIX Workshop on
Hot Topics in Cloud Computing

July 8, 2019

www.usenix.org/hotcloud19

HotEdge ’19: 2nd USENIX Workshop on
Hot Topics in Edge Computing

July 9, 2019

www.usenix.org/hotedge19

SOUPS 2019: Fifteenth Symposium on
Usable Privacy and Security

August 11–13, 2019, Santa Clara, CA, USA

Co-located with USENIX Security ’19

www.usenix.org/soups2019

28th USENIX Security Symposium

August 14–16, 2019, Santa Clara, CA, USA

Co-located with SOUPS 2019

www.usenix.org/sec19

Co-located with USENIX Security ’19

PEPR ’19: 2019 USENIX Conference on Privacy Engineering Practice and Respect

August 12–13, 2019

www.usenix.org/pepr19

WOOT ’19: 13th USENIX Workshop on Offensive Technologies

August 12–13, 2019

www.usenix.org/woot19

CSET ’19: 12th USENIX Workshop on Cyber Security Experimentation and Test

August 12, 2019

www.usenix.org/cset19

ScAINet ’19: 2019 USENIX Security and AI Networking Conference

August 12, 2019

www.usenix.org/scainet19

FOCI ’19: 9th USENIX Workshop on Free and Open Communications on the Internet

August 13, 2019

www.usenix.org/foci19

HotSec ’19: 2019 USENIX Summit on Hot Topics
in Security

August 13, 2019

www.usenix.org/hotsec19

SREcon19 Europe/Middle East/Africa

October 2–4, 2019, Dublin, Ireland

www.usenix.org/srecon19europe

LISA19

October 28–30, 2019, Portland, OR, USA

Submissions due June 18, 2019

www.usenix.org/lisa19

Enigma 2020

January 27–29, 2020, San Francisco, CA, USA

Submissions due August 21, 2019

www.usenix.org/enigma2020

FAST ’20: 18th USENIX Conference on File and Storage Technologies

February 24–27, 2020, Santa Clara, CA, USA

Sponsored by USENIX in cooperation with ACM SIGOPS

Co-located with NSDI ’20

Submissions due September 26, 2019

www.usenix.org/fast20

NSDI ’20: 17th USENIX Symposium on Networked Systems Design and Implementation

February 25–27, 2020, Santa Clara, CA, USA

Sponsored by USENIX in cooperation with ACM SIGCOMM and ACM SIGOPS

Co-located with FAST ’20

Spring paper titles and abstracts due
September 12, 2019

www.usenix.org/nsdi20

USENIX Open Access Policy

USENIX is the first computing association to offer free and open access to all of our conference proceedings and videos. We stand by our mission to foster excellence and innovation while supporting research with a practical bias. Please help us support open access by becoming a USENIX member and asking your colleagues to do the same!

www.usenix.org/membership

EDITOR

Rik Farrow
rik@usenix.org

MANAGING EDITOR

Michele Nelson
michele@usenix.org

COPY EDITORS

Steve Gilmartin

Amber Ankerholz

PRODUCTION

Arnold Gatilao

Ann Heron

Jasmine Murcia

TYPESETTER

Star Type
startype@comcast.net

USENIX ASSOCIATION

2560 Ninth Street, Suite 215
Berkeley, California 94710
Phone: (510) 528-8649
FAX: (510) 548-5738

www.usenix.org

;login: is the official magazine of the USENIX Association. ;login: (ISSN 1044-6397) is published quarterly by the USENIX Association, 2560 Ninth Street, Suite 215, Berkeley, CA 94710.

$90 of each member’s annual dues is for a subscription to ;login:. Subscriptions for nonmembers are $90 per year. Periodicals postage paid at Berkeley, CA, and additional mailing offices.

POSTMASTER: Send address changes to ;login:, USENIX Association, 2560 Ninth Street, Suite 215, Berkeley, CA 94710.

©2019 USENIX Association

USENIX is a registered trademark of the
USENIX Association. Many of the designa-tions used by manufacturers and sellers to distinguish their products are claimed as trademarks. USENIX acknowledges all trademarks herein. Where those designations appear in this publication and USENIX is aware of a trademark claim,
the designations have been printed in caps
or initial caps.
Musings

Rik Farrow

Rik is the editor of ;login:.
rik@usenix.org

While musing, I like to wonder what it would be like to live in a world without buggy software. That is, a world very unlike the one we live in. As I write this, Boeing’s 737 MAX plane has been grounded, apparently because buggy software and not documenting its possible dangerous effects have killed over 300 people in two separate crashes. Businesses and home users regularly have their data encrypted by criminals demanding ransom. And whole countries are in turmoil via careful manipulation of opinion via social media.

I attend conferences looking for people with interesting and potentially useful ideas. I first met Kostya Serebryany at Enigma 2016, where I tried to get him to write about the work he has been doing in security. He deferred then. Kostya then contacted me in the Fall of 2018 excited about something I find exciting as well: adding security features to hardware. We’ve published articles from several authors about hardware features to improve security, as well as problems with hardware solutions, such as the ability to extract data from Intel’s secure enclave, Meltdown [1].

Kostya most recently has worked on fuzzing, techniques for probing programs for potentially exploitable bugs. In 2015, Peter Gutmann wrote about various fuzzing techniques, something that Kostya has long worked on, and that’s related to what he wrote about for this issue [2].

Weaknesses in C/C++
I’ve long joked that C was a macro-assembly language: a convenience layer for those who needed to write code near to the speed of assembly [3], but with the convenience of variable labels, for loops, subroutine call handling, and structures. When I first encountered C, I immediately fell in love with structures, as the concept made some of the things I needed to do so much clearer than calculating offsets in assembler would have been. And, to be honest, I was really bad at calculating offsets. C beat the hell out of writing in Intel assembly (or VAX or Motorola assembler too).

But C and C++ lack certain safety features found in modern languages like Java, Go, Swift, and certainly Rust. In C and C++, you could specify array indices far beyond the end of the array you’d locally allocated, leading to buffer overflows on the stack. You could do this as well in the heap, and you could also do this with pointers into memory. I consider C and C++ to be languages for expert programmers, because they made it so easy to do the wrong thing. I always assumed that the authors of these languages were highly intelligent and expert programmers themselves, and that they had written these languages for their own convenience. In the case of C, that was certainly true, although the authors would be sharing C with other Bell Labs employees and, eventually, professors at various universities.

Bjarne Stroustrup, also at AT&T Bell Labs, came along a bit later, added classes to C, but kept all its wonderful and dangerous flaws. That is, you could create classes and instantiate objects, but you could also overrun arrays, leak memory, and abuse pointers.

Smashing the Stack
The Internet Worm really made people aware of the danger of buffer overruns. The finger daemon used the C function gets(), which collects a string into an array previously allocated but doesn’t check to see whether the length of the array is sufficient. This function still exists in libc, and the man page includes the warning, “Never use this function.” Makes you sort of wonder why it’s still there.

I learned much more about smashing stacks from Elias Levy’s famous article about buffer overflows [4]. I recreated the finger daemon for class exercises and gave students short C programs they could use to attack the finger daemon, whose real purpose was to run the who command and return the results over the network. When correctly exploited, the attack would instead run /bin/sh.

And this was only part of the problem with C and C++. There were also ways to exploit file structures that contain pointers to functions, or to use a little known option of format() to carefully overwrite portions of the stack, allowing exploits that used Return Oriented Programming (ROP). And this is just a partial list.

There are other issues with C/C++ that have to do with pointers. Using malloc() returns a pointer to a block of memory, and free() releases that block. But it’s quite common for programmers to either forget to free memory (a memory leak) or to use a pointer to memory after it had been freed (use-after-free).

During the first time I met Kostya, he showed me dozens of places in the Linux kernel where memory was used after it was freed and was still unpatched upstream. I could tell he was agitated about this.

Today C and C++ are the second and third most popular programming languages (as of April 10, 2019) in the Tiobe Index [5]. Looking at language popularity in another way, I asked Chris Wysopal of Veracode about how many programs in various languages that they analyze each year, and Chris provided me with the diagram in Figure 1. Veracode’s numbers, based on the thousands for binary programs analyzed, present a different picture, where C/C++ is less popular.

I found myself wishing that C would just go away, but Kostya assured me that that’s not going to be happening, as IoT devices will use slower CPUs and have less memory, and they are going to need compact and fast languages. Damn.

The Lineup
Jasmine Peled, Bendert Zevenbergen, and Nick Feamster have written a column about ethics, regarding something I had never heard of, called mcTLS. You might think that something with TLS in its name has to do with encrypting Internet traffic, and you’d be right. However, mcTLS has to do with creating a method so that TLS can be decrypted by middle boxes. If you think this is a bad idea, Peled and her co-authors agree with you, and explain why even the initial researchers should have considered this. Note that the IETF isn’t happy about mcTLS either, mainly because including TLS in the name violates copyright as well as having the ability to confuse people about their Internet traffic actually being secure.

Kostya Serebryany has written about a security extension in hardware, something I consider a wonderful idea (in case you skipped the earlier part of this column). Sun, now part of Oracle, first came up with the notion of including tags to help prevent a variety of bugs and the successful exploitation of those bugs, and now ARM plans on doing this as well.

I interviewed Mark Loveless, aka Simple Nomad. I’ve known Mark for many years, and we got together during Enigma ’19 to chat and begin this interview. Mark is definitely someone you should call a hacker, unlike Beto O’Rourke, whose membership in the Cult of the Dead Cow predates most of the cDc’s hacking activities. Mark has interesting stories to tell.

Anuj Kalia, Michael Kaminsky, and David Andersen have written about eRPC. You might recognize the authors’ names from an earlier article about RDMA. This article, like the first one, is based on a paper, this time at NSDI ’19. While their paper takes a deeper dive, Kalia et al. explain how this open source RPC library can be faster than those that rely on niche networking technologies.

Daniel Bittman, Peter Alvaro, Darrell Long, and Ethan Miller write about how to avoid bit-flipping in programming data structures. Based on a FAST ’19 paper, Bittman et al. explain why bit-flipping may be considered harmful for persistent memories, like Micron’s XPoint. But what I particularly like about their work is that it offers a different way of thinking about, and using, traditional data structures like linked-lists and B-trees that is often faster—and involves smaller structures and fewer bit flips.

Vladimir Legeza and Anton Golubtsov tell us how to make logging much more useful. Legeza, now working at Google, and Golubtsov (Amazon) suggest what should be commonsense methods for having standards for your logging messages. Legeza first suggested this idea as an opinion article, but I consider it much more along the line of best practices. I wish I had read such an article 35 years ago!

Laura Nolan considers complexity, taking a different perspective from Dave Mangot’s “Boring Tech” article [6] in the Spring 2019 issue. Laura first describes what is meant by software complexity, then how systems complexity differs from the software version. Laura does a great job, and she has volunteered to write columns about SRE issues.

Peter Norton has written about how you can use a tool based on Python to create portable configuration files. The external format is YAML, and the code performs static type checking, helping to prevent errors in configuration.

Mac McEniry decided to cover the use of password managers. Mac has previously written about Hashicorp’s Vault (Winter 2017) [7], but this time around he looks at three different Go libraries for secure storage of passwords for use by applications: Keychain (Mac), Windows Credential Manager, and a library called keyring that will work on Linux and the other OSes as well.

Dave Josephsen considers just how weird and wonderful it is to be living in the middle of nowhere in Montana. Then Dave gets down to business and begins explaining why he likes Prometheus for monitoring so much and how it’s used.

Dan Geer ponders about just how common exploited software bugs might be. Working from various data sources, Dan tells us that the problems with software bugs are much worse than you likely suspect, and even worse than I imagined.

Robert Ferrell suggests that we tone down our expectations for technology. After all, flying cars are still experimental, and even Amazon has decided that having a special button just for ordering laundry detergent might not be the best use of technology.

Mark Lamourine has written three book reviews, covering Refactoring (second edition), Concurrency in Go, and Cloud Native Go. I reviewed David Clark’s Designing an Internet, and also wrote two short reviews of books for summertime reading: Marcia Bjornerud’s Timefulness and Max Gladstone’s Empress of Forever.

In Closing
There are problems with all programming languages. For example, while Rust is much safer by design, you can write Rust code in unsafe mode, disabling its safety features. Java does checks and prohibits array overruns, but the JVM is written in C++, and it has had numerous vulnerabilities over the years.

I also asked Chris Wysopal if he could tell me what proportion of exploitable bugs came from code that processed input, and he answered 75%. If you’ve been reading ;login: for the last five years, you will have noticed, and hopefully read, many articles relating to LangSec, for example [8, 9]. LangSec, roughly, is the notion that security could be tremendously improved by paying more attention to input parsing, and Chris’s comment about the majority of vulnerabilities coming from input parsing problems supports this.

When I heard about LangSec and learn about efforts to create better support for security in hardware, I imagine that the problem of software insecurity will soon be solved. But I am forgetting several things.

First, most programmers are, by definition, of average skill level. Second, few programmers know much about security, and far fewer have a clue about LangSec. Third, some protocols, like the text (versus binary) version of X.509 certificates, cannot be parsed securely because the design requires a complex parser. And finally, even when ARM or Intel produce security features that will greatly reduce successful exploits, most people won’t enable them, either because they don’t understand them or because such features cause programs to fail sometimes—an indication of programming flaws they’d prefer to ignore.

References
[1] D. Gruss, D. Hansen, B. Gregg, “Kernel Isolation: From an Academic Idea to an Efficient Patch for Every Computer,” ;login:, vol. 43, no. 4 (Winter 2018): https://www.usenix.org
/publications/login/winter2018/gruss.

[2] P. Gutmann, “Fuzzing Code with AFL,” ;login:, vol. 41, no. 2 (Summer 2016) : https://www.usenix.org/publications/login
/summer2016/gutmann.

[3] Wikipedia, “Assembly Language: Macros,” last modified on March 25, 2019: https://en.wikipedia.org/wiki/Assembly
_language#Macros.

[4] E. Levy, “(Aleph One), Smashing the Stack for Fun and Profit,” Phrack, vol. 7, no. 49: http://phrack.org/issues/49/14.html.

[5] Tiobe Index, April 2019: https://www.tiobe.com/tiobe-index/.

[6] D. Mangot, “Achieving Reliability with Boring Technology,” ;login:, vol. 44, no. 1 (Spring 2019): https://www.usenix.org
/publications/login/spring2019/mangot.

[7] C. McEniry, “Go: HashiCorp’s Vault,” ;login:, vol. 42, no. 4 (Winter 2017): https://www.usenix.org/publications/login
/winter2017/schock.

[8] S. Bratus, M. Patterson, and A. Shubina, “The Bugs We Have to Kill,” ;login:, vol. 40, no. 4 (August 2015): https://www.usenix
.org/publications/login/aug15/bratus.

[9] J. Bangert and N. Zeldovich, “Nail: A Practical Tool for Parsing and Generating Data Formats,” ;login:, vol. 40, no. 1 (February 2015): https://www.usenix.org/publications/login/feb15/bangert.
The Man in the Middlebox: Violations of End-to-End Encryption

Jasmine Peled, Bendert Zevenbergen, and Nick Feamster

Jasmine Peled currently works on computer network analysis at the Department of Defense. She recently graduated from Princeton University, where she studied computer science and philosophy. Her work at Princeton focused on how undergraduate computer science courses can better incorporate material about ethics in order to encourage students to consider the ethical and societal implications of the technologies they develop. Jasmine’s senior thesis, “Towards a Pedagogy of Principles: Teaching Ethics in Computer Science,” received Princeton’s Outstanding Senior
Thesis Award. jasminepeled21@gmail.com

Ben Zevenbergen is a visiting professional specialist at the Center for Information Technology Policy at Princeton. His work mostly consists of multidisciplinary investigations in the ethical, social, and legal impacts of Internet technologies, and vice versa. At CITP Ben is working on the engineering ethics and political theory impacts of artificial intelligence. Ben is currently finishing a PhD at the Oxford Internet Institute about the research ethics for technical projects that involve unsuspecting Internet users as data subjects.
benzevenberger@princeton.edu

Nick Feamster is a Professor in the Computer Science Department at Princeton University and the Deputy Director of the Princeton University Center for Information Technology Policy (CITP). He was formerly a Professor at Georgia Tech, and received his MEng and PhD degrees from MIT. He has won many awards for his networking research, at ACM SIGCOMM, IMC, and USENIX NSDI. Nick is also an avid distance runner, having completed nearly 20 marathons and the Comrades ultra-marathon in South Africa.
feamster@cs.princton.edu

We consider the ethical issues of the paper “Multi-Context TLS (mcTLS): Enabling Secure In-Network Functionality in TLS” [8], which presents a method to extend the Transport Layer Security (TLS) protocol to allow it to support middleboxes. Specifically, to what extent should third parties be able to decrypt traffic between two Internet endpoints for various purposes, ranging from performance to security? This is the first column in a series about ethics that we hope will encourage ongoing discussion and debate in the research community about ethical considerations that may arise in the course of networking, security, and systems research.

Ongoing research in the computer science communities of security, privacy, and networking investigates and develops network applications and appliances that may improve Internet performance and security, often by modifying traffic en route between two Internet endpoints. Middleboxes constitute one such example of this capability; middleboxes are defined as “any intermediary box performing functions apart from normal, standard functions of an IP router on the data path between a source host and destination host” [1]. Middlebox functionality includes transcoding videostreams to different bit rates or detecting attacks, often through inspection of the contents of a packet’s payload.

Because some of this functionality can require inspecting the contents of network traffic, these middleboxes may need to break end-to-end encryption, decrypting traffic midstream to facilitate operating on packet contents. mcTLS describes mechanisms for breaking the end-to-end encryption of TLS specifically to enable middleboxes to view and edit data and metadata.

Middleboxes and End-to-End Encryption
The rise of end-to-end encryption is generally heralded as a positive development, as it protects both the integrity and confidentiality of communications between Internet endpoints, thus protecting sensitive transactions and preserving user privacy.

On the other hand, if traffic is encrypted, conventional middleboxes have difficulty performing any operation that depends on seeing packet contents. In response, researchers have grappled with this problem in various ways [6]. One approach involves developing techniques that can still operate on encrypted traffic, including techniques that can perform operations on packet headers alone [5] or limited types of operations on encrypted messages [11]. Yet, certain types of operations that require deep packet inspection may be either inefficient or ineffective when payloads are encrypted; thus, another approach involves developing a “backdoor” of sorts that allows an Internet service provider (ISP) to decrypt encrypted communications in flight.

ISPs have developed an increased interest in deploying middleboxes that perform operations on traffic that is en route between source and destination. For example, ISPs often deploy middleboxes that perform intrusion detection and detect a range of different types of attacks; these middleboxes may also perform certain performance optimizations, such as transcoding a videostream to a lower bit rate or performing other types of optimizations (e.g., WAN acceleration, load balancing). These operations may depend on at least inspecting traffic contents; in some cases, the traffic contents may even be modified.

Multi-context TLS (mcTLS) is one such technology; it permits ISPs to decrypt secure, end-to-end sessions of TLS Internet traffic by third parties, allowing them to control, read, and write the data in the communications. The authors of the paper [8] outline several technical advantages to mcTLS:
- � In-network functions may be more effective at scale, in contrast to relying on endpoint-based functionality alone.
- � Middleboxes may be useful for both users and service operators in terms of speed and data storage.
- � Middleboxes may help protect personal information by acting as a watchdog over applications that may leak data unwittingly.
mcTLS is based on the premise that, just like end-to-end encryption, middleboxes are a “useful part of the Internet and are here to stay.” More generally, the question of whether (and how) middleboxes should have access to encrypted communications is under active discussion in industry standards organizations, such as the Internet Engineering Task Force (IETF) [7].

A natural question concerns whether the increased in-network capabilities that result from breaking end-to-end encryption offer benefits that outweigh the risks of harm to stakeholders. A related question concerns whether the development and deployment of such research should focus on technologies that weaken end-to-end encryption in favor of potentially improved security and performance, versus technologies that can operate on traffic with encrypted payloads, potentially with reduced effectiveness.

The Appropriate Ethical Lens
Ethical analysis can take many forms, which are best understood on a spectrum. On one end of the spectrum is normative ethics—as practiced in academic philosophy—which studies reasoning methods such as utilitarianism, deontology, and virtue ethics. Ethics compliance frameworks such as research ethics or medical ethics—which consist of more formal procedures for specific professions—are on the other end of the spectrum. In between these two approaches to ethics are several other, more applied types of ethics sub-disciplines, such as information ethics, technology ethics, computer ethics, data ethics, bioethics, animal ethics, among many others. Compliance-ethics frameworks typically consist of “check-box exercises” that may be rooted in law; applied ethics have some generally agreed upon methodologies for reasoning about sectors of society; and normative ethics studies the reasoning methods themselves. For this article, it is relevant to establish whether man-in-the-middle technologies such as mcTLS should be analyzed through the lens of research ethics or through a different approach.

The framework of research ethics is typically an appropriate lens for an academic paper. This framework is commonly applied to a study or experimentation when (1) it presents research in the formal sense, and (2) when the research is conducted with human subjects.
In the United States, research in the formal sense is defined in the US Code of Federal Regulations on the Protection of Human Subjects as a “systematic investigation, including research development, testing and evaluation, designed to develop or contribute to generalizable knowledge” [10].

Once it has been established that a given paper constitutes research, the next question is whether the authors conduct research on human subjects. Formal regulations on the protection of human subjects in research apply to persons who conduct research (e.g., the Common Rule [2]). Although security and networking researchers typically see themselves as conducting research on technical systems, the Internet is more properly understood as a socio-technical system in which humans and technology interact. Humans will often be implicated in data collection.

The mcTLS technology aims to intercept the Internet traffic of humans, though the paper discussed in this column merely proposes a novel functionality but does not actually present data from experimentation on live Internet traffic. Instead, the paper presents the research and development of a new technology. Therefore, the formal framework of research ethics (such as the Common Rule) need not be applied. However, even when formal requirements do not demand research conform to a research ethics checklist, researchers should still assess the broader ethical impact of their work. After all, research that does not constitute “human-subject research” may still affect people, and this series of columns seeks to bring to mind some questions that researchers should be asking themselves.

Research into computers and networked systems have traditionally challenged the principles laid out in existing research ethics procedures, such as the Belmont Report. In response, several computer science communities embraced the Menlo Report [3], which interprets the principles of the Belmont Report [4] and applies them to computer and information security and measurement research specifically. Additional networked systems ethics guidelines were developed through lengthy processes of reflection and iteration in workshops by scholars from many different disciplines [9]. Because the Menlo Report is more applicable to experimentation with human subjects on the Internet, the analysis in this article will lean on the concepts presented in Networked Systems Ethics Guidelines [9].

Technology Ethics Analysis
The Networked Systems Ethics Guidelines suggest that researchers aim to understand a technology within the social context where it operates. This social context includes an analysis of the stakeholders, the aims, benefits, risks of harm, meaning of collected data in context, shifts in power, and an understanding of the affected values. The guidelines then suggest analyzing the impact of the values on stakeholders and the socio-technical environment, the values themselves, and any foreseeable unintended consequences. It is useful to link these analyses to the technical sources of the original design. When the impact of technical alternatives have been considered in minimizing risks of harm, the guidelines suggest managing the residual risks through information governance methods, also known as responsible data stewardship. We will preface each section with a question from the guidelines.

Aims and Benefits
What are the aims and benefits of the project? How will the research benefit society and specific stakeholders?

The technology presented in the mcTLS paper [8] realizes a technology to intercept, analyze, and possibly manipulate Internet traffic that has been encrypted on an end-to-end basis. The proposed tool would replace previous “hacks,” which ostensibly decrease security in the existing all-or-nothing security model. The authors state the aims of the mcTLS project concretely as follows: (1) to optimize network resource usage, (2) to improve user experience, and (3) to protect clients and servers from security threats. This tool would only be applied with the consent of all the parties involved in the connection.

Naylor et al. [8] state some further benefits that could be considered as secondary goals. For example, the authors mention that the in-network services may increase competition, innovation, and choice for end-users. Another stated benefit is that the use of middleboxes may reduce energy consumption by all stakeholders on the Internet.

The aims and benefits appear to be presented from the point of view of an ISP or network operator. The interests of end-users on the Internet are scarcely considered. The second-order benefits to society are difficult—if not impossible—to prove or support with evidence, and the paper does not consider some of the unintended social harms that may result from this tool, particularly the fact that breaking end-to-end encryption in this way will give the network operator complete power to read users’ Internet traffic.

Privacy
Which definitions or explanations will be used to assess a value? Is the risk of harm high, medium, or low?

The interception and possible processing and dissemination of end-users’ Internet traffic data may be considered a violation of their privacy. The concept of privacy is vague and illusive, however, and has thus been difficult to define precisely. Privacy may be best understood as an umbrella term referring to a group of related concepts, issues, and values that protect the individual’s private life from intrusions by others. The use of mcTLS on end users’ encrypted traffic violates the sub-category of information privacy, especially if their data is further processed or disseminated to third parties.

Privacy violations can be harmful in immaterial ways, though they may also reveal information about persons that can lead to physical, financial, reputational, or other types of harm, depending on the actor who receives the information and decides to act upon it. Different types of information have different types of impact on persons when revealed, depending on the context. Given the mediating role of the Internet to support modern life, encrypted Internet traffic intercepted by mcTLS will likely contain a large variety of information types, concerning a large and diverse set of persons.

To assess the risk of harm, one must consider the type of attacker who may be interested in the information that mcTLS may expose, the level of technical sophistication they have, what actions could be taken based on the new knowledge, and what the consequences would be for an Internet user. Given the large amount of Internet traffic generated by a variety of end-users that mcTLS could intercept, all types of attackers—from individual hackers to well-resourced government surveillance actors—should be taken into account. Further, mcTLS creates a point of failure for a variety of actors to gain access to Internet traffic through both security vulnerabilities and traditional legal procedures.

Further, mcTLS poses threats to privacy by altering the context in which certain information is processed and handled. Information that may be acceptable for both endpoints of communication to view should not necessarily be shared with third parties. For example, a user may choose to enter Personally Identifiable Information (PII) into a healthcare site in order to receive personalized care, but sharing this information in one context does not constitute approval for their ISP to share it with other companies. This could violate the Health Insurance Portability and Accountability Act (HIPAA), as well as the trust that users place in their ISP to keep communications and data private.

Due to the large variety of users, stakeholders, and their purpose for using the Internet, it is nearly impossible to generalize the risk of harm and define it precisely and meaningfully. This makes it especially challenging to assess the ethical tradeoffs presented by an emerging technology. Further, what may be considered harmless today may become a much larger threat in future. For example, the creation of new data sets may allow identification of Internet users in ways that cannot be foreseen today.

Violations of end-user privacy may be justified to some extent by gaining their consent or when serving the greater good. However, an informed consent notice or other justifications should be based on factual information and informed assessments rather than self-serving arguments of increased efficiency. The complex and international nature of the Internet complicates such an analysis, because risks of privacy harm should first be defined and identified for all affected Internet users in their contexts. This is, of course, a near impossible task.

Autonomy, Consent, and Choice
Do you need to rely on informed consent from participants and stakeholders? Which stakeholders carry the burdens of the study?

The Belmont Report gives guidance regarding the respect for autonomy, balancing the value of autonomy with the interests of others:

“To respect autonomy is to give weight to autonomous persons’ considered opinions and choices while refraining from obstructing their actions unless they are clearly detrimental to others” [4].

To achieve the aims and deliver the benefits identified in the paper, the existing security that users currently enjoy due to end-to-end encryption will be violated. Of course, most Internet users may not have a full understanding of the security mechanisms currently in place or even awareness of the existence of end-to-end encryption in the first place. This situation raises the question of whether taking away a good that users enjoy unwittingly as a means to achieve another end—the relative benefit of which is itself debatable—is a valid justification.

Informed consent is widely considered to be a mechanism that operationalizes the concept of autonomy of Internet users. Indeed, the authors state that both endpoints of a connection within which an mcTLS is deployed must consent to its use. However, similarly to the realm of healthcare, a key aspect of informed consent is being informed of reasonable alternatives to the proposed action. In the context of mcTLS, respect for autonomy may be understood as the obligation to fully inform an Internet user of the benefits and risks of harm in their particular context. The rejection of these benefits and risks of harm should not lead to a suspension of their Internet connection but possibly to access an alternative network within which the mcTLS tool is not operational.

Alternatively, an ISP or network operator could choose to base the legitimacy of the increase in power on a more paternalistic approach, whereby they interpret their duty of care to justify the use of mcTLS, along with its benefits and risks of harm. This constitutes a use of power over Internet users that may require some balancing through accountability mechanisms (see the Accountability section, below). For example, the ISP or network operator may choose to publish their considered justification for the use of mcTLS in their network, along with a technical description that allows some auditing of their system, as well as an information governance (or data stewardship) statement to which it can be held accountable by end-users. It is critical, though, that these explanations of benefits and potential harms posed by mcTLS do not simply use technical jargon to scare off the average user from understanding the full implications of middlebox technologies, so that supposed informed consent is, in fact, informed.

Many of these ethical concerns regarding privacy, autonomy, and choice could be resolved through agreements between ISPs and users about whether mcTLS will be implemented and how user data will be used. However, the next two sections present ethical challenges to the deployment of mcTLS which do not have such clear solutions.

Stakeholders and Power Shifts
Are particular stakeholders empowered or disempowered as a result of this project? Which values will the project conceivably impact?

ISPs and network operators will be the actors that implement and have access to mcTLS; these actors ultimately make the decision to implement and deploy such systems. These actors already have significant power over information flows, as the de facto gatekeepers to the Internet with the ability to control, manipulate, and, in some cases, observe data flows between their subscribers and other sites on the Internet. mcTLS further amplifies their power over Internet users, giving them the ability to observe the contents of network communications that might otherwise have been encrypted.

Internet users, on the other hand, will be disempowered over the collection and use of their data. Once a user has given consent to the use of mcTLS on their traffic, it will be difficult to control how their Internet traffic is collected, processed, and further disseminated, which may result in a violation of privacy. An informed consent notice referring to end-to-end encryption and the functionality of mcTLS is unlikely to be meaningful to most Internet users. First, an informed consent notice is unlikely to give the end-user meaningful information regarding the creation of a single-point-of-failure within their Internet traffic and the possible attackers or interested parties that may subsequently gain access to their data. Further, a rejection of the mcTLS tool on their Internet traffic may lead the ISP or network operator to suspend Internet access of the end-users, thereby offering users a choiceless choice (or Hobson’s choice) whereby the user is asked to agree with a technically complex violation of their encrypted end-to-end connection. This may constitute a violation of their autonomy.

The mcTLS paper does not differentiate between Internet users in its analysis of benefits and harms. It is important to note that the benefits to some users can result in vastly increasing risks of harm for other users. For example, the use of middleboxes on the Internet traffic of oppressed peoples or whistleblowers in countries where the rule of law is not as effective as the authors’ home country should be considered.

Unintended Consequences
Does the project potentially set a precedent for unethical methodologies that could be misused by others in the future?

Although developers of new technologies may not be directly responsible for misuses of their products under the law or under typical “checklist” research ethics restrictions, developers should still take care to mitigate potential unintended negative consequences. It is therefore important that researchers engage actively with the possibility that their methods and technologies may be misused, and design ways to mitigate those identified risks and harms. The most common ways projects influence future malevolent technology uses is through function creep and precedent setting. The following questions can help address the future concerns of creating a technology that enables a so-called “back door” into end-to-end encrypted Internet traffic.

Function creep occurs when functionality of a technology is used for other purposes than for which it was originally intended. Researchers and developers may want to consider for which other—more malevolent—aims the mcTLS technology can be used. It is relevant to consider a wide array of threat actors that would have an interest in using mcTLS for their own aims. When even companies such as Experian and Equifax are unable to keep their data secure, it is important to consider whether users can truly expect ISPs to protect their information and how adding a third party complicates this. How could the developers mitigate these foreseeable malevolent uses through their technical design?

Precedent-setting occurs when other researchers or developers can point at the use of mcTLS’s technology or functionality to justify the development and use of new technologies. Technology is typically a double-edged sword that can be used for both good and bad purposes. It is therefore important to interrogate the use of precedents critically. Developers should consider how other future malevolent developers can utilize the existence and use of mcTLS to justify the development and use of technologies that cause more harms. For example, does the interception of end-to-end encrypted traffic by ISPs for efficiency in finding malware justify the interception of encrypted traffic to create profiles of Internet users for law enforcement?

When the risks of harm to stakeholders and potential unintended consequences have been identified, the researchers may pinpoint the technological causes of harms. For example, the main cause of harms is the creation of a back door and concentrated point of access for encrypted Internet traffic. Researchers should consider ways to address these issues and justify why alternative designs (or not acting at all) may be most beneficial.

Accountability
Which measures are taken to allow affected stakeholders to address concerns effectively?

Accountability is the concept that allows actors to be held liable or answerable for their actions. When an actor gains power over other stakeholders from the introduction of a technology, and the new actions may violate particular values, this increase in power should be accompanied by an increase in accountability. Accountability thus serves as a rebalancing mechanism.

Several governance mechanisms exist to allow for the exercise of accountability. For example, data governance policies can include codes of practice for employees and organizations within a sector to limit the extent to which technologies may be (mis)used. Other mechanisms include a statement of data collection policies, data retention periods for collected data, mitigation strategies for unforeseen risks, and limits on the further use or dissemination of collected data. Technical measures include information security strategies, de-identification of collected data, and further encryption of retained data. Meaningful accountability can be achieved when an organization is transparent about these policies and technical choices, as it allows third parties to audit and limit the exercise of power.

Conclusion
The introduction of technology in an environment will inevitably empower some actors over others. This is also true for mcTLS, a tool that breaks the end-to-end encryption of Internet traffic to achieve some beneficial ends, such as increased efficiency in identifying and solving security issues. However, the means by which these ends are achieved may conceivably cause harms to individual Internet users due to the shift in power over Internet traffic. End-users’ autonomy and privacy are likely violated, which have further social consequences. The developers may explore options to remedy these violations through technical means. However, not all problems are solvable through technology. Therefore, the actors who employ a technology such as mcTLS should consider rebalancing their newly gained power over Internet users with accountability mechanisms, allowing for transparency (and audibility) of the systems and clear information governance policies to which affected parties can hold the operators to account.

References
[1] B. Carpenter and S. Brim, “Middleboxes: Taxonomy and Issues,” 2002: https://tools.ietf.org/html/rfc3234 .

[2] “Federal Policy for the Protection of Human Subjects (‘Common Rule’),” 1991: https://www.hhs.gov/ohrp/regulations-and
-policy/regulations/common-rule/index.html.

[3] D. Dittrich and E. Kenneally, “The Menlo Report: Ethical Principles Guiding Information and Communication Technology Research,” US Department of Homeland Security, 2012: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=
2445102.

[4] National Commission for the Protection of Human Subjects of Biomedical and Behavioral Research, “The Belmont Report: Ethical Principles and Guidelines for the Protection of Human Subjects of Research,” Department of Health, Education, and Welfare, 1979: https://www.hhs.gov/ohrp/regulations-and
-policy/belmont-report/read-the-belmont-report/index.html.

[5] G. Gu, R. Perdisci, J. Zhang, W. Lee, “BotMiner: Clustering Analysis of Network Traffic for Protocol-and Structure-
Independent Botnet Detection,” in Proceedings of the 17th
USENIX Security Symposium (USENIX Security ’08), pp. 139–154: http://static.usenix.org/events/sec08/tech/
full_papers/gu/gu
_html/.

[6] K. Moriarty, “TLS Security and Data Center Monitoring: Searching for a Path Forward,” August 2017: https://www
.rsa.com/en-us/blog/2017-08/tls-security-and-data-center
-monitoring-searching-for-a-path-forward.

[7] K. Moriarty and A. Morton, “Effects of Pervasive Encryption on Operators,” 2018: https://tools.ietf.org/html/draft-mm-wg
-effect-encrypt-14.

[8] D. Naylor, K. Schomp, M. Varvello, I. Leontiadis, J. Blackburn, D. R. López, K. Papagiannaki, P. R. Rodriguez, and P. Steenkiste, “Multi-Context TLS (mcTLS): Enabling Secure In-Network Functionality in TLS,” in ACM SIGCOMM Computer Communication Review, vol. 45 (August 2015), pp. 199–212.

[9] “Networked Systems Ethics—Guidelines,” last modified on July 10, 2017: http://networkedsystemsethics.net/index.php
?title=Networked_Systems_Ethics_-_Guidelines.

[10] “Code of Federal Regulations, Title 45, Public Welfare, and Part 46, Protection of Human Subjects,” Department of Health and Human Services, 2009: https://www.hhs.gov/ohrp/sites
/default/files/ohrp/policy/ohrpregulations.pdf.

[11] J. Sherry, C. Lan, R. A. Popa, and S. Ratnasamy, “Blindbox: Deep Packet Inspection over Encrypted Traffic,” in ACM SIGCOMM Computer Communication Review, vol. 45 (August 2015), pp. 213–226.
ARM Memory Tagging Extension and
How It Improves C/C++ Memory Safety

Kostya Serebryany

Konstantin (Kostya) Serebryany is a Software Engineer at Google. His team develops and deploys dynamic testing tools, such as AddressSanitizer, MemorySanitizer, ThreadSanitizer, and libFuzzer. Prior to joining Google in 2007, Konstantin spent four years at Elbrus/MCST working for Sun compiler lab and then three years at Intel Compiler Lab. Konstantin holds a PhD from Moscow State University of Economics, Statistics, and Informatics and an MS from Moscow State University.
kcc@google.com

I discuss memory safety bugs typical to C and C++, current tools and approaches to finding such bugs or mitigating their risk, and a new
hardware feature, ARM MTE, that promises to be the biggest improvement since the introduction of page protection.

Memory (Un)safety
More than 30 years after the Internet Worm, we are still talking about memory safety bugs in C and C++ programs. Numerous improvements in the software development process are dwarfed by the exponential increase in the amount of software, its exposed attack surface, and the discovery of new attack techniques.

Memory safety bug is an umbrella term to represent program defects inherent in C and C++ but also present in other languages. The most common classes of bugs are buffer overflows, heap-use-after-free, and stack-use-after-return.

These bugs often make the code vulnerable to exploitation. Malicious actors can leverage memory-unsafe behavior to remotely execute code, leak sensitive information, escalate privileges, or escape VMs. A buffer overflow in OpenSSL, nicknamed Heartbleed, achieved notoriety for its ease of exploitation and high impact. It allowed attackers to steal a server’s private memory, including cryptographic information such as keys and passwords, without being detected. But named bugs like Heartbleed and Stagefright, a family of remotely exploitable bugs in Android, are just the tip of the iceberg.

Thousands of memory safety bugs are filed as CVEs every year. Roughly two-thirds of all CVEs in the Android platform are memory safety bugs. A similar picture is seen across the industry, affecting browsers, operating systems, and server-side and IoT software [1, 2]. And even these bugs are still the tip of the iceberg. Many more bugs do not get CVEs assigned, and many others remain unknown to software vendors. Some are being silently exploited, others cause hard to detect data corruption, and some lie dormant waiting to strike.

Typical Bugs
Before we dive deeper, let’s take a closer look at two of our most beloved insects.

A heap-buffer-overflow happens when an object of a certain size is allocated on the heap, and then a pointer to this object is used to access memory outside of the object bounds. Typically, the object is an array of n elements, and the code accesses the i-th element where
i < 0 or i >= n.

int *array = new int[n]; // heap allocation

array[n] = 42; // buffer overflow

array[-1] = 42; // buffer overflow (underflow)

array[100500] = 42; // buffer overflow, assuming n <= 100500

A heap-use-after-free happens when an object is allocated on the heap, and later deallocated, but a pointer to the object is preserved somewhere and is used to access the deallocated memory.

Object *obj = new Object; // heap allocation, or “malloc”

delete obj; // heap deallocation, or “free”

obj->member = 0; // heap-use-after-free, or

// access via a dangling pointer

In both cases the buggy memory access touches someone else’s memory. In the C and C++ standards this is considered undefined behavior. In real life it may cause a loud crash, a
silent data corruption, or a convenient back door.

Existing Tools and Practices
We haven’t been exactly ignoring the problem for 30 years.

Coding practices and testing tools reduced the likelihood of introducing a memory bug. A test-driven development process together with dynamic testing tools like AddressSanitizer [3] or Valgrind will help avoid many bugs. Fuzzing (and, ideally, fuzz-driven development [4]) will pick up the next layer of bugs. Some memory bugs can be spotted by static analysis.

Software-based code-hardening techniques make it harder for attackers to exploit memory safety bugs that reach production. Stack cookies, non-executable memory, ASLR, control flow integrity (LLVM CFI, Microsoft CFG, Shadow Call Stack), and other techniques help prevent memory safety bugs from diverting program control flow, the end goal of many exploits. Hardened memory allocators, such as Scudo Hardened Allocator or Chrome’s Partition Alloc, frustrate exploitation and may make it impossible in some cases.

Hardware-based solutions have begun to appear as well. ARM Pointer Authentication, already available in the most recent Apple hardware, cryptographically authenticates return addresses and discourages attackers from using return-oriented programming (ROP). Intel Control-flow Enforcement Technology is expected to appear soon to solve ROP in a different way, by keeping the return address on a separate stack with special permissions.

All these tools are making our software more stable and secure, but they are not enough. No amount of testing guarantees the absence of bugs, and existing exploit mitigations only prevent some attacks, while almost entirely ignoring others, e.g., data-oriented attacks.

Among the hardware-based solutions two stand out, SPARC ADI and ARM MTE, both implementations of a concept known as memory tagging or memory coloring. SPARC ADI has been available in mass-produced hardware since 2016; we covered this feature in an earlier paper [5]. This article focuses on ARM MTE.

ARM MTE
On September 2018 ARM announced the Memory Tagging Extension, or MTE [6], a part of the ARM v8.5 architecture. It does not yet exist in real hardware, but everything else about this extension is very promising.

The extension introduces a notion of two types of tags: address tags and memory tags.

An address tag is a 4-bit value stored at the top of every pointer in the process. MTE utilizes top-byte-ignore, an existing AArch64 feature that instructs the hardware to ignore the topmost byte of addresses, allowing this byte to be used as user-controlled metadata. Therefore MTE is applicable only to 64-bit software.

A memory tag is a 4-bit value associated with every aligned 16-byte region of application memory (memory granule). The way memory tags are stored is a hardware implementation detail. Logically, every 16 bytes of memory now contain an extra 4 bits of metadata in addition to 128 bits of data.

Every time a heap region is allocated, the software chooses a random 4-bit tag and marks both the address and all the newly allocated memory granules with this tag. The load and store instructions verify that the address tag matches the memory tag, causing a hardware exception on tag mismatch. MTE introduces new instructions to manipulate the tags.

Let’s look at the example in Figure 1. When the user code requests 20 bytes of heap to be allocated, operator new() rounds up the size to the 16-byte boundary (i.e., to 32), allocates a 32-byte chunk of memory (i.e., two 16-byte memory granules), chooses a random 4-bit tag (in this case, 0xA), puts this tag into the top-byte of the address, and updates the tags for the two newly allocated memory granules (the white-colored regions in the diagram). The adjacent memory regions have different memory tags (light gray granules have the tag 0x7, dark gray granules have the tag 0xE), so when the code tries to access memory at offset 32 from the pointer, MTE raises an exception because the tag of the pointer does not match the tag of the memory granule being accessed.

Figure 2 demonstrates an example of how heap-use-after-free is detected. On deallocation, operator delete() changes the tag of all three deallocated granules of memory from 0xD to 0x4, so that any access to this memory via an old (dangling) pointer causes an exception because the pointer still has the old tag 0xD. The adjacent memory regions (tagged with 0x9 and 0xB) are not affected by retagging of this region.

You may have noticed that bug detection with MTE is probabilistic. Indeed, there are only 16 possible values of a 4-bit tag. One random tag will be different from another random tag with a probability of 15/16 or ~93%. It is up to the software to decide whether to increase this probability with other tricks. For example, in order to detect contiguous buffer overflows with perfect accuracy, the allocator may enforce that tags for adjacent chunks are never equal.

With MTE, the heap memory is tagged inside malloc() and free(), and the tag checking is performed by the hardware. It means that recompilation will not be required for detecting heap-related bugs. MTE can also identify stack-use-after-return and buffer overflows on the stack or in global variables, but it will require recompilation with extra compiler options.

Comparison with AddressSanitizer
AddressSanitizer is a widely used tool for detecting memory safety issues. It uses compiler instrumentation to observe all loads and stores. Its specialized malloc “poisons” red zones around heap objects to detect buffer overflows and keeps freed memory in quarantine to detect use-after-free. The red zones and the quarantine are the major causes of AddressSanitizer’s high memory overhead.

MTE is conceptually similar to AddressSanitizer: both detect bugs at runtime, both require special functionality in malloc and free, and both require some amount of compiler support.

However, the use of address tags makes MTE sufficiently different: it does not require red zones or quarantine to detect bugs. This allows MTE to consume less memory. Moreover, MTE performs checking in hardware, thus eliminating the overhead of compiler instrumentation for every load and store.

Compared to AddressSanitizer, MTE brings the following benefits:
- � MTE checking can be turned on and off at runtime.
- � CPU overhead is expected to be very small, hopefully a small single-digit percentage, while AddressSanitizer typically has 2x–3x slowdown.
- � MTE can find heap-related bugs without recompilation.
- � Due to the small overhead, the same binary can be used for testing and for production.
- � MTE’s memory overhead is 3%–5%, compared to 2x–3x for AddressSanitizer.
- � Memory accesses that happen far from the object bounds or long after the object lifetime are more likely to be spotted by MTE than AddressSanitizer, which makes MTE a better exploit mitigation.
The only downside of MTE is that it may fail to detect buffer overflows that happen within the 16-byte granule:

char *array = new char [13]; // allocates one 16-byte granule

array[14] = 0; // access within the same 16-byte granule

Various software strategies are possible to improve bug detection for such cases with additional cost or complexity.

Uses of MTE
We envision several different usage modes for MTE.

First, MTE is going to be a much nicer version of AddressSanitizer for testing and fuzzing. It will find more bugs at a fraction of the cost. In many cases it will allow testing using the same binary as shipped to production.

Second, MTE could be used as a mechanism for testing in production (e.g., crowdsourced bug detection), always-on or enabled randomly. For client software, such as web browsers, it means that when a bug happens on a user device it will be detected, and, with user consent, an actionable bug report will be sent to the vendor. For server-side software it means that even the rarest bugs will be detected immediately once they get triggered.

Finally, MTE can be seen as a strong security mitigation. It is true that it prevents exploitation with less than 100% probability, but the probability is still very high, and the first failed exploitation attempt will warn the user and the software vendor. We believe that memory tagging will detect the most common classes of memory safety bugs in the wild, helping vendors identify and fix them and discouraging malicious actors from exploiting them.

Other clever ways to use MTE will likely be discovered. MTE may allow building debuggers with infinite hardware watchpoints, efficient race detectors, or faster garbage collectors.

HWASAN
The full potential of memory tagging will only be available with future hardware, several years from now. But you can reap some of the benefits now, like significantly reduced memory consumption, by using a software implementation of memory tagging: HWASAN (hardware-assisted AddressSanitizer) [7]. HWASAN is similar in spirit to AddressSanitizer, but its smaller memory footprint makes it a better choice on memory-restricted devices, such as mobile phones. Today, the tool only supports 64-bit ARM CPUs, since it requires the top-byte-ignore feature and a small modification in the kernel to allow passing tagged addresses to system calls.

Compatibility
MTE and HWASAN offer a high level of compatibility with existing code bases. We build the Android platform and the Chromium browser with HWASAN with few source code changes.

However, we have observed several cases of incompatibility.
In one such case, pointers to a particular type had application-specific metadata stored in the top 16 address bits. In another case, a pointer was cast to double and then back, losing the lower address bits. In one more case, the code computed difference between the addresses of local variables from different stack frames as a way to measure recursion depth. All these cases were easy to fix.

Related Work
With this article I hope to increase the awareness of the concept of memory tagging, as well as ARM’s fantastic Memory Tagging Extension, so that other CPU vendors adopt it sooner rather than later. Unlike most other existing hardware security extensions, ARM MTE directly addresses the memory safety bugs, that is, the root cause of many vulnerabilities, not just how attackers happen to exploit their consequences today. Beyond its effectiveness as a mitigation, MTE also serves as an effective bug detection tool that can be deployed in the wild. But even MTE is not a panacea for all classes of memory safety bugs.

Intra-Object-Buffer-Overflow
There are other classes of C/C++ bugs waiting to be dealt with. One such bug class is called intra-object-buffer-overflow.

struct S {

int array[5];

int another_field;

};

int GetInt(int *p, size_t idx) {

return p[idx];

}

int Foo(S *s) {

return GetInt(s->array, 5);

}

Here, by accessing an array out of bounds we end up reading another field in the same struct. In this case, AddressSanitizer, HWASAN, or MTE will not find the bug because the access happens within the same heap- (or stack-) allocated object. The Undefined Behavior Sanitizer (UBSan) can detect some simper cases, but not the more complex ones like this one because the function GetInt() that accesses the memory has lost the static bound information available in Foo(). There were multiple attempts to solve this problem (including at least one hardware extension, Intel MPX), but none were practical enough to be widely used.

A potential solution would combine dynamic bounds checking, static analysis (proving that either the code is correct or that dynamic checks are effective), and the banning of certain language constructs (like passing sub-objects without their bound information to unknown functions). For modern C++ code, perhaps the best solution is to replace arrays inside structs or classes with std::array and rely on the runtime for bounds checking.

Type-Confusion
Another bug class not directly addressed by MTE is type-confusion.

struct Image {

int pixels[100];

};

struct Secret {

int sensitive_data[200];

};

Secret *secret = new Secret;

...

DrawOnScreen((Image*) secret);

This code performs a cast between incompatible types; the following memory accesses in DrawOnScreen() will mistakenly access sensitive data without violating object bounds or lifetimes.

A potential solution is to use a stricter subset of C++ that disallows some invalid casts statically (via compile-time errors) and some other invalid casts dynamically (using a mechanism such as implemented in LLVM CFI).

Uninitialized Memory
A side effect of MTE is that whenever a memory allocation is tagged, it can also be initialized at no extra cost. The new ARM instructions can store memory tags and initialize the memory itself at the same time. Therefore, enabling MTE for an application’s heap and stack will mitigate most vulnerabilities from another class, uses of uninitialized memory.

However, we do not have to wait for MTE to eradicate this class of bugs. For example, Clang/LLVM 9.0 will have an option [8] to automatically initialize all stack variables.

Safer Languages
No discussion of memory safety in C and C++ can ignore the existence of “safe languages.” Java, Go, Swift, and Rust, among others, are indeed much safer, and in many cases they are a better choice for developing new software.

But none of them are really safe. Go and Swift have data races, Java’s huge runtime is itself written in C++, and only Rust comes close to being safe, at a cost of a (subjectively) steeper learning curve.

All of these languages, of course, have the “unsafe” escape hatch. Whenever the unsafe section is used, it turns the language into C, but just slightly worse, because fewer tools, practices, and habits are available for that language to avoid memory safety bugs. Here, again, Rust is probably the best with its support for AddressSanitizer and fuzzing. MTE will be useful for Rust and any other memory-safe language with “unsafe” code.

Besides, the billions of lines of C and C++ code are not going away any time soon.

GWP-ASan
GWP-ASan [9] is another bug detection tool that finds heap-use-after-free and heap-buffer-overflows. It relies on protected guard pages, the old trick used in the Electric Fence Malloc and similar tools. But there is a twist: guarded allocations are sampled. This means that the overhead, and the bug detection probability, can be scaled to be arbitrarily small. The small probability of bug detection can be improved by deploying the tool at large scale in production. We are beginning to detect bugs this way in the Google Chrome browser and other software.

GWP-ASan is not a replacement for AddressSanitizer or HWASAN since it handles a smaller subset of bugs and has very low detection probability, but it finds bugs that evade testing and only manifest in production. In the most performance-critical applications, where even 1% overhead is prohibitively expensive, we will be able to use MTE to implement sampled bug detection similar to GWP-ASan, but with a much lower cost and hence higher sampling and detection rate.

Conclusion
Once available in hardware, the ARM Memory Tagging Extension will reduce C and C++ memory unsafety from disastrous to tolerable. Hopefully, other hardware vendors will implement their variants of memory tagging. Before that happens, don’t forget to test your software with all available testing tools (e.g., AddressSanitizer or HWASAN) and fuzzers (e.g., libFuzzer),
and harden your binaries in production.

Acknowledgments
I want to thank my colleagues Vlad Tsyrklevich, Dmitry Vyukov, Alexander Potapenko, and Evgeniy Stepanov for helping me prepare this article.

References
[1] K. Serebryany, “Hardware Memory Tagging to Make C/C++ Memory Safe(r),” iSecCon’18: https://github.com/google
/sanitizers/blob/master/hwaddress-sanitizer/MTE-iSecCon
-2018.pdf.

[2] M. Miller, “Trends, Challenges, and Strategic Shifts in the Software Vulnerability Mitigation Landscape,” BlueHat 2019: https://www.youtube.com/watch?v=PjbGojjnBZQ.

[3] K. Serebryany, D. Bruening, A. Potapenko, D. Vyukov, “AddressSanitizer: A Fast Address Sanity Checker,” 2012 USENIX Advanced Technical Conference (USENIX ATC ’12): https://www.usenix.org/system/files/conference/atc12/atc12
-final39.pdf.

[4] K. Serebryany, “OSS-Fuzz—Google’s Continuous Fuzzing Service for Open Source Software,” 26th USENIX Security Symposium (USENIX Security ’17): https://www.usenix.org
/conference/usenixsecurity17/technical-sessions/presentation
/serebryany.

[5] K. Serebryany, E. Stepanov, A. Shlyapnikov, V. Tsyrklevich, D. Vyukov, “Memory Tagging and How It Improves C/C++ Memory Safety”: https://arxiv.org/pdf/1802.09517.pdf.

[6] Arm A-Profile Architecture Developments 2018: Armv8.5-A: https://community.arm.com/processors/b/blog/posts/arm
-a-profile-architecture-2018-developments-armv85a.

[7] HWASAN documentation: https://clang.llvm.org/docs/HardwareAssistedAddressSanitizerDesign.html.

[8] J. F. Bastien, “Automatic Variable Initialization”: https://
reviews.llvm.org/D54604.

[9] GWP-ASan for Chromium documentation: https://chromium
.googlesource.com/chromium/src/+/lkgr/docs/gwp_asan.md.