Galapagos: Model-driven discovery of end-to-end application-storage relationships in distributed systems

Kostas Magoutis, Murthy Devarakonda, Nikolai Joukov, Norbert G. Vogl

Research output: Contribution to journalArticle

26 Citations (Scopus)

Abstract

Modern business information systems are typically multi-tiered distributed systems comprising Web services, application services, databases, enterprise information systems, file systems, storage controllers, and other storage systems. In such environments, data is stored in different forms at multiple tiers, with each tier associated with some level of data abstraction. An information entity owned by an application generally maps to several data entities, logically associated across tiers and related to the application. Discovery of such relationships in a distributed system is a challenging problem, complicated by the widespread adoption of virtualization technologies and by the traditional tendency to manage each tier as an independent domain. In this paper, we present a system and methodology for model-driven discovery of end-to-end application-data relationships spanning multiple tiers, from the applications to the lowest levels of the storage hierarchy. The key to our methodology involves modeling how data is used and transformed by distributed software components. An important benefit of our system, which we call Galapagos, is the ability to reflect business decisions expressed at the application level to the level of storage.

Original languageEnglish (US)
Pages (from-to)367-377
Number of pages11
JournalIBM Journal of Research and Development
Volume52
Issue number4-5
DOIs
StatePublished - Nov 11 2008
Externally publishedYes

Fingerprint

Information systems
Industry
Web services
Data structures
Controllers
Virtualization

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Galapagos : Model-driven discovery of end-to-end application-storage relationships in distributed systems. / Magoutis, Kostas; Devarakonda, Murthy; Joukov, Nikolai; Vogl, Norbert G.

In: IBM Journal of Research and Development, Vol. 52, No. 4-5, 11.11.2008, p. 367-377.

Research output: Contribution to journalArticle

@article{6c66be366ad44273b09633b278c2fa67,
title = "Galapagos: Model-driven discovery of end-to-end application-storage relationships in distributed systems",
abstract = "Modern business information systems are typically multi-tiered distributed systems comprising Web services, application services, databases, enterprise information systems, file systems, storage controllers, and other storage systems. In such environments, data is stored in different forms at multiple tiers, with each tier associated with some level of data abstraction. An information entity owned by an application generally maps to several data entities, logically associated across tiers and related to the application. Discovery of such relationships in a distributed system is a challenging problem, complicated by the widespread adoption of virtualization technologies and by the traditional tendency to manage each tier as an independent domain. In this paper, we present a system and methodology for model-driven discovery of end-to-end application-data relationships spanning multiple tiers, from the applications to the lowest levels of the storage hierarchy. The key to our methodology involves modeling how data is used and transformed by distributed software components. An important benefit of our system, which we call Galapagos, is the ability to reflect business decisions expressed at the application level to the level of storage.",
author = "Kostas Magoutis and Murthy Devarakonda and Nikolai Joukov and Vogl, {Norbert G.}",
year = "2008",
month = "11",
day = "11",
doi = "10.1147/rd.524.0367",
language = "English (US)",
volume = "52",
pages = "367--377",
journal = "IBM Journal of Research and Development",
issn = "0018-8646",
publisher = "IBM Corporation",
number = "4-5",

}

TY - JOUR

T1 - Galapagos

T2 - Model-driven discovery of end-to-end application-storage relationships in distributed systems

AU - Magoutis, Kostas

AU - Devarakonda, Murthy

AU - Joukov, Nikolai

AU - Vogl, Norbert G.

PY - 2008/11/11

Y1 - 2008/11/11

N2 - Modern business information systems are typically multi-tiered distributed systems comprising Web services, application services, databases, enterprise information systems, file systems, storage controllers, and other storage systems. In such environments, data is stored in different forms at multiple tiers, with each tier associated with some level of data abstraction. An information entity owned by an application generally maps to several data entities, logically associated across tiers and related to the application. Discovery of such relationships in a distributed system is a challenging problem, complicated by the widespread adoption of virtualization technologies and by the traditional tendency to manage each tier as an independent domain. In this paper, we present a system and methodology for model-driven discovery of end-to-end application-data relationships spanning multiple tiers, from the applications to the lowest levels of the storage hierarchy. The key to our methodology involves modeling how data is used and transformed by distributed software components. An important benefit of our system, which we call Galapagos, is the ability to reflect business decisions expressed at the application level to the level of storage.

AB - Modern business information systems are typically multi-tiered distributed systems comprising Web services, application services, databases, enterprise information systems, file systems, storage controllers, and other storage systems. In such environments, data is stored in different forms at multiple tiers, with each tier associated with some level of data abstraction. An information entity owned by an application generally maps to several data entities, logically associated across tiers and related to the application. Discovery of such relationships in a distributed system is a challenging problem, complicated by the widespread adoption of virtualization technologies and by the traditional tendency to manage each tier as an independent domain. In this paper, we present a system and methodology for model-driven discovery of end-to-end application-data relationships spanning multiple tiers, from the applications to the lowest levels of the storage hierarchy. The key to our methodology involves modeling how data is used and transformed by distributed software components. An important benefit of our system, which we call Galapagos, is the ability to reflect business decisions expressed at the application level to the level of storage.

UR - http://www.scopus.com/inward/record.url?scp=55449125392&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=55449125392&partnerID=8YFLogxK

U2 - 10.1147/rd.524.0367

DO - 10.1147/rd.524.0367

M3 - Article

AN - SCOPUS:55449125392

VL - 52

SP - 367

EP - 377

JO - IBM Journal of Research and Development

JF - IBM Journal of Research and Development

SN - 0018-8646

IS - 4-5

ER -