Abstract—In today’s distributed information systems, a large amount of monitoring data such as log files have been collected. These monitoring data at various points of a distributed information system provide unparallel opportunities for us to characterize and track the information system via effectively correlating all monitoring data across the distributed system. [1] proposed a concept named flow intensity to measure the intensity with which the monitoring data reacts to the volume of different user requests. The AutoRegressive model with eXogenous inputs (ARX) was used to quantify the re-lationship between each pair of flow intensity measured at various points across distributed systems. If such relationships hold all the time, they ar...
We present a statistical probing-approach to distributed fault-detection in networked systems, based...
Abstract. A method for automated analysis of fault-tolerance of distributed systems is presented. It...
Diagnosing performance problems in modern datacenters and distributed systems is challenging, as the...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
University of Minnesota Ph.D. dissertation. August 2020. Major: Chemical Engineering. Advisor: Prodr...
system performance, Bayesian networks, information retrieval, problem signatures We present a method...
Automated behavior analysis is a valuable technique in the development and maintainence of distribut...
Distributed systems are difficult to debug and understand. A key reason for this is distributed sta...
Large-scale dynamic systems, such as the Internet, as well as emerging peerto-peer networks and comp...
Abstract: This work considers the problem of obtaining optimal estimates via distributed computation...
Distributed systems have become pervasive in current society. From laptops and mobile phones, to ser...
This work considers the problem of obtaining optimal estimates via distributed computation in a larg...
Various methods have been proposed to identify emergent dynamical structures in complex systems. In ...
This paper addresses the problem of selection and discovery of a consistent availability monitoring ...
We present a statistical probing-approach to distributed fault-detection in networked systems, based...
Abstract. A method for automated analysis of fault-tolerance of distributed systems is presented. It...
Diagnosing performance problems in modern datacenters and distributed systems is challenging, as the...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
It is notoriously hard to develop dependable distributed systems. This is partly due to the difficul...
University of Minnesota Ph.D. dissertation. August 2020. Major: Chemical Engineering. Advisor: Prodr...
system performance, Bayesian networks, information retrieval, problem signatures We present a method...
Automated behavior analysis is a valuable technique in the development and maintainence of distribut...
Distributed systems are difficult to debug and understand. A key reason for this is distributed sta...
Large-scale dynamic systems, such as the Internet, as well as emerging peerto-peer networks and comp...
Abstract: This work considers the problem of obtaining optimal estimates via distributed computation...
Distributed systems have become pervasive in current society. From laptops and mobile phones, to ser...
This work considers the problem of obtaining optimal estimates via distributed computation in a larg...
Various methods have been proposed to identify emergent dynamical structures in complex systems. In ...
This paper addresses the problem of selection and discovery of a consistent availability monitoring ...
We present a statistical probing-approach to distributed fault-detection in networked systems, based...
Abstract. A method for automated analysis of fault-tolerance of distributed systems is presented. It...
Diagnosing performance problems in modern datacenters and distributed systems is challenging, as the...