Follow
Edward Chuah
Edward Chuah
University of Aberdeen
Verified email at acm.org - Homepage
Title
Cited by
Cited by
Year
Diagnosing the root-causes of failures from cluster log files
E Chuah, S Kuo, P Hiew, WC Tjhi, G Lee, J Hammond, MT Michalewicz, ...
2010 International Conference on High Performance Computing, 1-10, 2010
662010
Linking resource usage anomalies with system failures from cluster log data
E Chuah, A Jhumka, S Narasimhamurthy, J Hammond, JC Browne, ...
2013 IEEE 32nd International Symposium on Reliable Distributed Systems, 111-120, 2013
632013
Crude: Combining resource usage data and error logs for accurate error detection in large-scale distributed systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS), 51-60, 2016
392016
Online failure prediction for hpc resources using decentralized clustering
A Pelaez, A Quiroz, JC Browne, E Chuah, M Parashar
2014 21st International Conference on High Performance Computing (HiPC), 1-9, 2014
282014
Towards detecting patterns in failure logs of large-scale distributed systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2015 IEEE International Parallel and Distributed Processing Symposium …, 2015
242015
Establishing hypothesis for recurrent system failures from cluster log files
E Chuah, G Lee, WC Tjhi, SH Kuo, T Hung, J Hammond, T Minyard, ...
2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure …, 2011
172011
Towards comprehensive dependability-driven resource use and message log-analysis for HPC systems diagnosis
E Chuah, A Jhumka, S Alt, D Balouek-Thomert, JC Browne, M Parashar
Journal of Parallel and Distributed Computing 132, 95-112, 2019
132019
Insights into the diagnosis of system failures from cluster message logs
E Chuah, A Jhumka, JC Browne, B Barth, S Narasimhamurthy
2015 11th European Dependable Computing Conference (EDCC), 225-232, 2015
132015
Using message logs and resource use data for cluster failure diagnosis
E Chuah, A Jhumka, JC Browne, N Gurumdimma, S Narasimhamurthy, ...
2016 IEEE 23rd International Conference on High Performance Computing (HiPC …, 2016
122016
Sentiment analysis based error detection for large-scale systems
KA Alharthi, A Jhumka, S Di, F Cappello, E Chuah
2021 51st Annual IEEE/IFIP International Conference on Dependable Systems …, 2021
112021
Enabling dependability-driven resource use and message log-analysis for cluster system diagnosis
E Chuah, A Jhumka, S Alt, T Damoulas, N Gurumdimma, MC Sawley, ...
2017 IEEE 24th International Conference on High Performance Computing, Data …, 2017
112017
An optimal smooth QoS adaptation strategy for QoS differentiated scalable media streaming
X Li, E Chuah, JY Tham, KH Goh
2008 IEEE International Conference on Multimedia and Expo, 429-432, 2008
92008
Using resource use data and system logs for HPC system error propagation and recovery diagnosis
E Chuah, A Jhumka, S Alt, JJ Villalobos, J Fryman, W Barth, M Parashar
2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2019
82019
Towards increasing the error handling time window in large-scale distributed systems using console and resource usage logs
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2015 IEEE Trustcom/BigDataSE/ISPA 3, 61-68, 2015
72015
Challenges in identifying network attacks using netflow data
E Chuah, N Suri, A Jhumka, S Alt
2021 IEEE 20th International Symposium on Network Computing and Applications …, 2021
32021
A survey of log-correlation tools for failure diagnosis and prediction in cluster systems
E Chuah, A Jhumka, M Malek, N Suri
IEEE Access 10, 133487-133503, 2022
22022
Failure diagnosis for cluster systems using partial correlations
E ChuahM, A Jhumka, S Alt, RT Evans, N Suri
2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2021
22021
Features correlation-based workflows for high-performance computing systems diagnosis
E Chuah
University of Warwick, 2020
22020
An empirical study of major page faults for failure diagnosis in cluster systems
E Chuah, A Jhumka, S Narasimhamurthy
The Journal of Supercomputing, 35, 2023
12023
On handling redundancy for failure log analysis of cluster systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
DEPEND 2015: The Eighth International Conference on Dependability, 2015
12015
The system can't perform the operation now. Try again later.
Articles 1–20