osdi 2021 accepted papers

Authors may use this for content that may be of interest to some readers but is peripheral to the main technical contributions of the paper. Mothy joined the Computer Science Department ETH Zurich in January 2007 and was named Fellow of the ACM in 2013 for contributions to operating systems and networking research. With her students, she had led research in AI, with a focus on robotics and machine learning, having concretely researched and developed a variety of autonomous robots, including teams of soccer robots, and mobile service robots. Petuum Awarded OSDI 2021 Best Paper for Goodput-Optimized Deep Learning Welcome to the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21) submissions site. By monitoring the status of each job during training, Pollux models how their goodput (a novel metric we introduce that combines system throughput with statistical efficiency) would change by adding or removing resources. Pollux improves scheduling performance in deep learning (DL) clusters by adaptively co-optimizing inter-dependent factors both at the per-job level and at the cluster-wide level. A scientific paper consists of a constellation of artifacts that extend beyond the document itself: software, hardware, evaluation data and documentation, raw survey results, mechanized proofs, models, test suites, benchmarks, and so on. Zeph executes privacy-adhering data transformations in real-time and scales to thousands of data sources, allowing it to support large-scale low-latency data stream analytics. In the Ethereum network, decentralized Ethereum clients reach consensus through transitioning to the same blockchain states according to the Ethereum specification. Hence, CLP enables efficient search and analytics on archived logs, something that was impossible without it. The biennial ACM Symposium on Operating Systems Principles is the world's premier forum for researchers, developers, programmers, vendors and teachers of operating system technology. Perennial 2.0 makes this possible by introducing several techniques to formalize GoJournals specification and to manage the complexity in the proof of GoJournals implementation. Just using Lambdas on top of CPU servers offers up to 2.75 more performance-per-dollar than training only with CPU servers. Please identify yourself as a presenter and include your mailing address in your email. Questions? Reviews will be available for response on Wednesday, March 3, 2021. Publications | Mosharaf Chowdhury Consensus bugs are bugs that make Ethereum clients transition to incorrect blockchain states and fail to reach consensus with other clients. Finding the inductive invariant of the distributed protocol is a critical step in verifying the correctness of distributed systems, but takes a long time to do even for simple protocols. The biennial ACM Symposium on Operating Systems Principles is the world's premier forum for researchers, developers, programmers, and teachers of computer systems technology. Welcome to the 2021 USENIX Annual Technical Conference (ATC '21) submissions site! Tao Luo, Mingen Pan, Pierre Tholoniat, Asaf Cidon, and Roxana Geambasu, Columbia University; Mathias Lcuyer, Microsoft Research. Upon these two primitives, our system can scale to thousands of concurrent enclaves with high resource utilization and eliminate the high-cost initialization of secure memory using fork-style enclave creation without weakening the security guarantees. We present case studies and end-to-end applications that show how Storm lets developers specify diverse policies while centralizing the trusted code to under 1% of the application, and statically enforces security with modest type annotation overhead, and no run-time cost. With the help of thousands of Lambda threads, Dorylus scales GNN training to billion-edge graphs. The hybrid segment recycling chooses a proper block reclaiming policy between segment compaction and threaded logging based on their costs. See the Preview Session page for an overview of the topics covered in the program. Ethereum is the second-largest blockchain platform next to Bitcoin. J.P. Morgan AI Research partners with applied data analytics teams across the firm as well as with leading academic institutions globally. Memory allocation represents significant compute cost at the warehouse scale and its optimization can yield considerable cost savings. She developed the technology for making network routing self-stabilizing, largely self-managing, and scalable. USENIX Security '21 has three submission deadlines. For general conference information, see https://www.usenix.org/conference/osdi22. We build Polyjuice based on our learning framework and evaluate it against several existing algorithms. Papers must be in PDF format and must be submitted via the submission form. If in doubt about whether your submission to OSDI 2021 and your upcoming submission to SOSP are the same paper or not, please contact the PC chairs by email. Main conference program: 5-8 April 2022. Starting with small invariant formulas and strongest possible invariants avoids large SMT queries, improving SMT solver performance. We will look at various problems and approaches, and for each, see if blockchain would help. Nico Lehmann and Rose Kunkel, UC San Diego; Jordan Brown, Independent; Jean Yang, Akita Software; Niki Vazou, IMDEA Software Institute; Nadia Polikarpova, Deian Stefan, and Ranjit Jhala, UC San Diego. The key insight guiding our design is computation separation. This paper describes the design, implementation, and evaluation of Addra, the first system for voice communication that hides metadata over fully untrusted infrastructure and scales to tens of thousands of users. Call for Papers. OSDI '21 Technical Sessions | USENIX Machine learning (ML) models trained on personal data have been shown to leak information about users. Zeph enforces privacy policies cryptographically and ensures that data available to third-party applications complies with users' privacy policies. The device then "calibrates" its interrupts to completions of latency-sensitive requests. Despite having the same end goals as traditional ML, FL executions differ significantly in scale, spanning thousands to millions of participating devices. We present the results of a 1% experiment at fleet scale as well as the longitudinal rollout in Googles warehouse scale computers. Collaboration: You have a collaboration on a project, publication, grant proposal, program co-chairship, or editorship within the past two years (December 2018 through March 2021). There is no explicit limit to the response, but authors are strongly encouraged to keep it under 500 words; reviewers are neither required nor expected to read excessively long responses. We present selective profiling, a technique that locates data locality problems with low-enough overhead that is suitable for production use. Han Meng - Research Assistant - Michigan State University | LinkedIn How can we design systems that will be reliable despite misbehaving participants? This budget is a scarce resource that must be carefully managed to maximize the number of successfully trained models. Authors must make a good faith effort to anonymize their submissions, and they should not identify themselves or their institutions either explicitly or by implication (e.g., through the references or acknowledgments). Second, it innovates on the underlying cryptographic machinery and constructs a new private information retrieval scheme, FastPIR, that reduces the time to process oblivious access requests for mailboxes. Sponsored by USENIX in cooperation with ACM SIGOPS. When uploading your OSDI 2021 reviews for your submission to SOSP, you can optionally append a note about how you addressed the reviews and comments. Her specialties include network routing protocols and network security. We introduce a hybrid cryptographic protocol for privacy-adhering transformations of encrypted data. Camera-ready submission (all accepted papers): 15 Mars 2022. After three years working on web-based collaboration systems at a startup in North Carolina, he joined Sprint's Advanced Technology Lab in Burlingame, California, in 1998, working on cloud computing and network monitoring. When registering your abstract, you must provide information about conflicts with PC members. Currently, for large graphs, CPU servers offer the best performance-per-dollar over GPU servers. This year, there were only 2 accepted papers from UK institutes. You must not improperly identify a PC member as a conflict if none of these three circumstances applies, even if for some other reason you want to avoid them reviewing your paper. Erhu Feng, Xu Lu, Dong Du, Bicheng Yang, and Xueqiang Jiang, Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University; Engineering Research Center for Domain-specific Operating Systems, Ministry of Education, China; Yubin Xia, Binyu Zang, and Haibo Chen, Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University; Shanghai AI Laboratory; Engineering Research Center for Domain-specific Operating Systems, Ministry of Education, China. Academic and industrial participants present research and experience papers that cover the full range of theory and practice of computer . We implement DeSearch for two existing decentralized services that handle over 80 million records and 240 GBs of data, and show that DeSearch can scale horizontally with the number of workers and can process 128 million search queries per day. This is the first OSDI in an odd year as OSDI moves to a yearly cadence. Concretely, Dorylus is 1.22 faster and 4.83 cheaper than GPU servers for massive sparse graphs. In particular, responses must not include new experiments or data, describe additional work completed since submission, or promise additional work to follow. If you submit a paper to either of those venues, you may not also submit it to OSDI 21. Additionally, there is no assurance that data processing and handling comply with the claimed privacy policies. Marius is open-sourced at www.marius-project.org. A.H. Hunter, Jane Street Capital; Chris Kennelly, Paul Turner, Darryl Gove, Tipp Moseley, and Parthasarathy Ranganathan, Google. Taking place in Carlsbad, CA from 11-13 July, OSDI is a highly selective flagship conference in computer science, especially on the topic of computer systems. The 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21) will take place as a virtual event on July 14-16, 2021. Because DistAI starts with the strongest possible invariants, if the SMT solver fails, DistAI does not need to discard failed invariants, but knows to monotonically weaken them and try again with the solver, repeating the process until it eventually succeeds. The novel aspect of the nanoPU is the design of a fast path between the network and applications---bypassing the cache and memory hierarchy, and placing arriving messages directly into the CPU register file. We describe Fluffy, a multi-transaction differential fuzzer for finding consensus bugs in Ethereum. If you have any questions about conflicts, please contact the program co-chairs. Prepublication versions of the accepted papers from the summer submission deadline are available below. The overhead of GPT is 5% for memory-intensive workloads (e.g., Redis) and negligible for CPU-intensive workloads (e.g., RV8 and Coremarks). Welcome to the 2021 USENIX Annual Technical Conference (ATC '21) submissions site! We discuss the design and implementation of TEMERAIRE including strategies for hugepage-aware memory layouts to maximize hugepage coverage and to minimize fragmentation overheads. Papers so short as to be considered extended abstracts will not receive full consideration. Simultaneous submission of the same work to multiple venues, submission of previously published work, or plagiarism constitutes dishonesty or fraud. DeSearch then introduces a witness mechanism to make sure the completed tasks can be reused across different pipelines, and to make the final search results verifiable by end users. Evaluations show that Vegito can perform 1.9 million TPC-C NewOrder transactions and 24 TPC-H-equivalent queries per second simultaneously, which retain the excellent performance of specialized OLTP and OLAP counterparts (e.g., DrTM+H and MonetDB). First, it enables a caller to push a message to a callee in two hops, using a new way of assigning mailboxes to users that resembles how a post office assigns PO boxes to its customers. Commonly used log archival and compression tools like Gzip provide high compression ratio, yet searching archived logs is a slow and painful process as it first requires decompressing the logs. This paper presents Zeph, a system that enables users to set privacy preferences on how their data can be shared and processed. We demonstrate that KEVIN reduces the amount of I/O traffic between the host and the device, and remains particularly robust as the system ages and the data become fragmented. Contact your program co-chairs, osdi21chairs@usenix.org, or the USENIX office, submissionspolicy@usenix.org. All deadline times are 23:59 hrs UTC. The program co-chairs will use this information at their discretion to preserve the anonymity of the review process without jeopardizing the outcome of the current OSDI submission. Editor in charge: Daniel Petrolia . Title Page, Copyright Page, and List of Organizers | Professor Veloso has been recognized with a multiple honors, including being a Fellow of the ACM, IEEE, AAAS, and AAAI. Fortunately, we observe that the backups for high availability in modern distributed OLTP systems can be retrofitted to bridge the analytical queries and transactions in HTAP workloads. USENIX Security '21 Summer Accepted Papers | USENIX DMon speeds up PostgreSQL, one of the most popular database systems, by 6.64% on average (up to 17.48%). The abstractions we design for the privacy resource mirror those defined by Kubernetes for traditional resources, but there are also major differences. If the conference registration fee will pose a hardship for the presenter of the accepted paper, please contact conference@usenix.org. Her robot soccer teams have been RoboCup world champions several times, and the CoBot mobile robots have autonomously navigated for more than 1,000km in university buildings. Although SSDs can be simplified under the current ZNS interface, its counterpart LFS must bear segment compaction overhead. We present Nap, a black-box approach that converts concurrent persistent memory (PM) indexes into NUMA-aware counterparts. Our evaluation on the SPEC benchmarks shows that SanRazor can reduce the overhead of sanitizers significantly, from 73.8% to 28.062.0% for AddressSanitizer, and from 160.1% to 36.6124.4% for UndefinedBehaviorSanitizer (depending on the applied reduction scheme). Paper Submission Information All submissions must be received by 11:59 PM AoE (UTC-12) on the day of the corresponding deadline. We observe that scalability challenges in training GNNs are fundamentally different from that in training classical deep neural networks and distributed graph processing; and that commonly used techniques, such as intelligent partitioning of the graph do not yield desired results. The papers will be available online to everyone beginning on the first day of the conference, July 14, 2021. To enable FL developers to interpret their results in model testing, Oort enforces their requirements on the distribution of participant data while improving the duration of federated testing by cherry-picking clients. Moreover, to handle dynamic workloads, Nap adopts a fast NAL switch mechanism. The wire-to-wire RPC response time through the nanoPU is just 69ns, an order of magnitude quicker than the best-of-breed, low latency, commercial NICs. We prove that DistAI is guaranteed to find the -free inductive invariant that proves the desired safety properties in finite time, if one exists. Web pages today commonly include large amounts of JavaScript code in order to offer users a dynamic experience. USENIX discourages program co-chairs from submitting papers to the conferences they organize, although they are allowed to do so. The 20th ACM Workshop on Hot Topics in Networks (HotNets 2021) will bring together researchers in computer networks and systems to engage in a lively debate on the theory and practice of computer networking. Shaghayegh Mardani, UCLA; Ayush Goel, University of Michigan; Ronny Ko, Harvard University; Harsha V. Madhyastha, University of Michigan; Ravi Netravali, Princeton University. Used Zotero to organize papers about the stress and diffusion between anode and electrolyte and made a summary . To resolve the problem, we propose a new LFS-aware ZNS interface, called ZNS+, and its implementation, where the host can offload data copy operations to the SSD to accelerate segment compaction. In this paper, we propose Oort to improve the performance of federated training and testing with guided participant selection. Session Chairs: Gennady Pekhimenko, University of Toronto / Vector Institute, and Shivaram Venkataraman, University of WisconsinMadison, Aurick Qiao, Petuum, Inc. and Carnegie Mellon University; Sang Keun Choe and Suhas Jayaram Subramanya, Carnegie Mellon University; Willie Neiswanger, Petuum, Inc. and Carnegie Mellon University; Qirong Ho, Petuum, Inc.; Hao Zhang, Petuum, Inc. and UC Berkeley; Gregory R. Ganger, Carnegie Mellon University; Eric P. Xing, MBZUAI, Petuum, Inc., and Carnegie Mellon University. When further combined with a simple caching strategy, our evaluation shows that P3 is able to outperform existing state-of-the-art distributed GNN frameworks by up to 7. Horcrux-compliant web servers perform offline analysis of all the JavaScript code on any frame they serve to conservatively identify, for every JavaScript function, the union of the page state that the function could access across all loads of that page. Authors are required to register abstracts by 3:00 p.m. PST on December 3, 2020, and to submit full papers by 3:00 p.m. PST on December 10, 2020. The key insight in blk-switch is that Linux's multi-queue storage design, along with multi-queue network and storage hardware, makes the storage stack conceptually similar to a network switch. However, your OSDI submission must use an anonymized name for your project or system that differs from any used in such contexts. Writing a correct operating system kernel is notoriously hard. Metadata from voice calls, such as the knowledge of who is communicating with whom, contains rich information about peoples lives. Copyright to the individual works is retained by the author[s]. In experiments with real DL jobs and with trace-driven simulations, Pollux reduces average job completion times by 37-50% relative to state-of-the-art DL schedulers, even when they are provided with ideal resource and training configurations for every job. We demonstrate the above using design, implementation and evaluation of blk-switch, a new Linux kernel storage stack architecture. 1 Acknowledgements: Paper prepared for the post-conference workshop on Food for Thought: Economic Analysis in Anticipation of the Next Farm Bill at the Agricultural and Applied Economics Association annual meeting, Austin, TX . Submissions may include as many additional pages as needed for references but not for appendices. The main contribution of this paper is GoJournal, a verified, concurrent journaling system that provides atomicity for storage applications, together with Perennial 2.0, a framework for formally specifying and verifying concurrent crash-safe systems. Our approach effectively eliminates high communication and partitioning overheads, and couples it with a new pipelined push-pull parallelism based execution strategy for fast model training. Dorylus is up to 3.8 faster and 10.7 cheaper compared to existing sampling-based systems. EuroSys 2021 Report - Systems Research Artifacts Attaching supplementary material is optional; if your paper says that you have source code or formal proofs, you need not attach them to convince the PC of their existence. For example, traditional compute resources are replenishable while privacy is not: a CPU can be regained after a model finishes execution while privacy budget cannot. As a member of ACCT, I have served two years on the bylaws and governance committee and two years on the finance and audit committee. We convert five state-of-the-art PM indexes using Nap. . With an aim to improve time-to-accuracy performance in model training, Oort prioritizes the use of those clients who have both data that offers the greatest utility in improving model accuracy and the capability to run training quickly. . Swapnil Gandhi and Anand Padmanabha Iyer, Microsoft Research. blk-switch evaluation over a variety of scenarios shows that it consistently achieves s-scale average and tail latency (at both 99th and 99.9th percentiles), while allowing applications to near-perfectly utilize the hardware capacity. Based on the observation that invariants are often concise in practice, DistAI starts with small invariant formulas and enumerates all strongest possible invariants that hold for all samples. Submission of a response is optional. For instance, FAST 21 and NSDI 21 have author-notification dates after the OSDI 21 abstract-registration deadline. In this paper, we present P3, a system that focuses on scaling GNN model training to large real-world graphs in a distributed setting. Hence, kernel developers are constantly refining synchronization within OS kernels to improve scalability at the risk of introducing subtle bugs. Message from the Program Co-Chairs. Our approach outperforms existing file systems on a block SSD by a wide margin 6.2 on average for metadata-intensive benchmarks. PET discovers and applies program transformations that improve computation efficiency but only maintain partial functional equivalence. Sep 2021 - Present 1 year 7 months. Professor Veloso is on leave from Carnegie Mellon University as the Herbert A. Simon University Professor in the School of Computer Science, and the past Head of the Machine Learning Department. Professor Veloso is the Past President of AAAI (the Association for the Advancement of Artificial Intelligence), and the co-founder, Trustee, and Past President of RoboCup. Secure hardware enclaves have been widely used for protecting security-critical applications in the cloud. To remedy this, we introduce DeSearch, the first decentralized search engine that guarantees the integrity and privacy of search results for decentralized services and blockchain apps. We propose a new framework for computing the embeddings of large-scale graphs on a single machine. Authors may submit a response to those reviews until Friday, March 5, 2021. Conference Dates: Apr 12, 2021 - Apr 14, 2021. Accepted paper for Luo Mai at OSDI 22 | InfWeb Paper abstracts and proceedings front matter are available to everyone now. We develop MAGE, an execution engine for SC that efficiently runs SC computations that do not fit in memory. Advisor: You have a past or present association as thesis advisor or advisee. We propose PET, the first DNN framework that optimizes tensor programs with partially equivalent transformations and automated corrections. HotNets provides a venue for discussing innovative ideas and for debating future research agendas in networking. Based on the observation that real-world workloads always feature skewed access patterns, Nap introduces a NUMA-aware layer (NAL) on the top of existing concurrent PM indexes, and steers accesses to hot items to this layer. HotCRP.com signin Sign in using your HotCRP.com account. Four months after we reported the bugs to Geth developers, one of the bugs was triggered on the mainnet, and caused nodes using a stale version of Geth to hard fork the Ethereum blockchain. USENIX new Date().getFullYear()>document.write(new Date().getFullYear()); Grants for Black Computer Science Students Application, Propose an interesting, compelling solution, Demonstrate the practicality and benefits of the solution, Clearly describe the paper's contributions, Clearly articulate the advances beyond previous work. MAGE outperforms the OS virtual memory system by up to an order of magnitude, and in many cases, runs SC computations that do not fit in memory at nearly the same speed as if the underlying machines had unbounded physical memory to fit the entire computation. For general conference information, see https://www . VLDB 2021 - 47th International Conference on Very Large Data Bases Password We demonstrate that Marius achieves the same level of accuracy but is up to one order of magnitude faster. Compared to a state-of-the-art fuzzer, Fluffy improves the fuzzing throughput by 510 and the code coverage by 2.7 with various optimizations: in-process fuzzing, fuzzing harnesses for Ethereum clients, and semantic-aware mutation that reduces erroneous test cases. As a result, data characteristics and device capabilities vary widely across clients. VLDB 2021: Venue Tivoli Hotel & Congress Center Arni Magnussons Gade 2 1577 Copenhagen, Denmark +45 3268 4300 In-person attendees can purchase tickets for the park / gardens with a 15% discount, which is a special offer by Tivoli Hotel & Congress Center to VLDB 2021 attendees. Proceedings Front Matter For realistic workloads, KEVIN improves throughput by 68% on average. They collectively make the backup fresh, columnar, and fault-tolerant, even facing millions of concurrent transactions per second. We present Storm, a web framework that allows developers to build MVC applications with compile-time enforcement of centrally specified data-dependent security policies. 2019 - Present. People often assume that blockchain has Byzantine robustness, so adding it to any system will make that system super robust against any calamity. Cores can safely and concurrently read from their local kernel replica, eliminating remote NUMA accesses. For conference information, see: . If your accepted paper should not be published prior to the event, please notify production@usenix.org. A graph embedding is a fixed length vector representation for each node (and/or edge-type) in a graph and has emerged as the de-facto approach to apply modern machine learning on graphs. This paper demonstrates that it is possible to achieve s-scale latency using Linux kernel storage stack, even when tens of latency-sensitive applications compete for host resources with throughput-bound applications that perform read/write operations at throughput close to hardware capacity. One important reason for the high cost is, as we observe in this paper, that many sanitizer checks are redundant the same safety property is repeatedly checked leading to unnecessarily wasted computing resources. Performance experiments show that GoNFS provides similar performance (e.g., at least 90% throughput across several benchmarks on an NVMe disk) to Linuxs NFS server exporting an ext4 file system, suggesting that GoJournal is a competitive journaling system. SanRazor adopts a novel hybrid approach it captures both dynamic code coverage and static data dependencies of checks, and uses the extracted information to perform a redundant check analysis. GoJournal is implemented in Go, and Perennial is implemented in the Coq proof assistant. While several new GNN architectures have been proposed, the scale of real-world graphsin many cases billions of nodes and edgesposes challenges during model training. Log search and log archiving, despite being critical problems, are mutually exclusive. To help more profitably utilize sanitizers, we introduce SanRazor, a practical tool aiming to effectively detect and remove redundant sanitizer checks. She also has made contributions in network security, including scalable data expiration, distributed algorithms despite malicious participants, and DDOS prevention techniques. Therefore, developers typically find data locality issues via dynamic profiling and repair them manually. OSDI brings together professionals from academic and industrial backgrounds in what has become a premier forum for discussing the design, implementation, and implications of systems software. Existing algorithms are designed to work well for certain workloads. DistAI generates data by simulating the distributed protocol at different instance sizes and recording states as samples. We implemented the ZNS+ SSD at an SSD emulator and a real SSD. However, Addra improves message latency in this architecture, which is a key performance metric for voice calls. Foreshadow was chosen as an IEEE Micro Top Pick. There are two major GNN training obstacles: 1) it relies on high-end servers with many GPUs which are expensive to purchase and maintain, and 2) limited memory on GPUs cannot scale to today's billion-edge graphs.

Colorado Court Case Number Format, Limiting Factors In The Everglades, Marina Mabrey Partner, Articles O

osdi 2021 accepted papers