It's who you know: Graph mining using recursive structural features

Keith Henderson; Brian Gallagher; Lei Li; Leman Akoglu; Tina Eliassi-Rad; Hanghang Tong; Christos Faloutsos

doi:10.1145/2020408.2020512

It's who you know: Graph mining using recursive structural features

Keith Henderson, Brian Gallagher, Lei Li, Leman Akoglu, Tina Eliassi-Rad, Hanghang Tong, Christos Faloutsos

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

180 Scopus citations

Abstract

Given a graph, how can we extract good features for the nodes? For example, given two large graphs from the same domain, how can we use information in one to do classification in the other (i.e., perform across-network classification or transfer learning on graphs)? Also, if one of the graphs is anonymized, how can we use information in one to de-anonymize the other? The key step in all such graph mining tasks is to find effective node features. We propose ReFeX (Recursive Feature eXtraction), a novel algorithm, that recursively combines local (node-based) features with neighborhood (egonet-based) features; and outputs regional features - capturing "behavioral" information. We demonstrate how these powerful regional features can be used in within-network and across-network classification and de-anonymization tasks - without relying on homophily, or the availability of class labels. The contributions of our work are as follows: (a) ReFeX is scalable and (b) it is effective, capturing regional ("behavioral") information in large graphs. We report experiments on real graphs from various domains with over 1M edges, where ReFeX outperforms its competitors on typical graph mining tasks like network classification and de-anonymization.

Original language	English (US)
Title of host publication	Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11
Publisher	Association for Computing Machinery
Pages	663-671
Number of pages	9
ISBN (Print)	9781450308137
DOIs	https://doi.org/10.1145/2020408.2020512
State	Published - 2011
Externally published	Yes
Event	17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011 - San Diego, United States Duration: Aug 21 2011 → Aug 24 2011

Publication series

Name	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference	17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011
Country/Territory	United States
City	San Diego
Period	8/21/11 → 8/24/11

Keywords

Feature extraction
Graph mining
Identity resolution
Network classification

ASJC Scopus subject areas

Software
Information Systems

Access to Document

10.1145/2020408.2020512

Cite this

Henderson, K., Gallagher, B., Li, L., Akoglu, L., Eliassi-Rad, T., Tong, H., & Faloutsos, C. (2011). It's who you know: Graph mining using recursive structural features. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11 (pp. 663-671). (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). Association for Computing Machinery. https://doi.org/10.1145/2020408.2020512

It's who you know: Graph mining using recursive structural features. / Henderson, Keith; Gallagher, Brian; Li, Lei et al.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11. Association for Computing Machinery, 2011. p. 663-671 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Henderson, K, Gallagher, B, Li, L, Akoglu, L, Eliassi-Rad, T, Tong, H & Faloutsos, C 2011, It's who you know: Graph mining using recursive structural features. in Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, pp. 663-671, 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011, San Diego, United States, 8/21/11. https://doi.org/10.1145/2020408.2020512

Henderson K, Gallagher B, Li L, Akoglu L, Eliassi-Rad T, Tong H et al. It's who you know: Graph mining using recursive structural features. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11. Association for Computing Machinery. 2011. p. 663-671. (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). doi: 10.1145/2020408.2020512

Henderson, Keith ; Gallagher, Brian ; Li, Lei et al. / It's who you know : Graph mining using recursive structural features. Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11. Association for Computing Machinery, 2011. pp. 663-671 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

@inproceedings{7e76f71bc7fb4c6a8401422b487f2926,

title = "It's who you know: Graph mining using recursive structural features",

abstract = "Given a graph, how can we extract good features for the nodes? For example, given two large graphs from the same domain, how can we use information in one to do classification in the other (i.e., perform across-network classification or transfer learning on graphs)? Also, if one of the graphs is anonymized, how can we use information in one to de-anonymize the other? The key step in all such graph mining tasks is to find effective node features. We propose ReFeX (Recursive Feature eXtraction), a novel algorithm, that recursively combines local (node-based) features with neighborhood (egonet-based) features; and outputs regional features - capturing {"}behavioral{"} information. We demonstrate how these powerful regional features can be used in within-network and across-network classification and de-anonymization tasks - without relying on homophily, or the availability of class labels. The contributions of our work are as follows: (a) ReFeX is scalable and (b) it is effective, capturing regional ({"}behavioral{"}) information in large graphs. We report experiments on real graphs from various domains with over 1M edges, where ReFeX outperforms its competitors on typical graph mining tasks like network classification and de-anonymization.",

keywords = "Feature extraction, Graph mining, Identity resolution, Network classification",

author = "Keith Henderson and Brian Gallagher and Lei Li and Leman Akoglu and Tina Eliassi-Rad and Hanghang Tong and Christos Faloutsos",

year = "2011",

doi = "10.1145/2020408.2020512",

language = "English (US)",

isbn = "9781450308137",

series = "Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",

publisher = "Association for Computing Machinery",

pages = "663--671",

booktitle = "Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11",

note = "17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011 ; Conference date: 21-08-2011 Through 24-08-2011",

}

TY - GEN

T1 - It's who you know

T2 - 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011

AU - Henderson, Keith

AU - Gallagher, Brian

AU - Li, Lei

AU - Akoglu, Leman

AU - Eliassi-Rad, Tina

AU - Tong, Hanghang

AU - Faloutsos, Christos

PY - 2011

Y1 - 2011

N2 - Given a graph, how can we extract good features for the nodes? For example, given two large graphs from the same domain, how can we use information in one to do classification in the other (i.e., perform across-network classification or transfer learning on graphs)? Also, if one of the graphs is anonymized, how can we use information in one to de-anonymize the other? The key step in all such graph mining tasks is to find effective node features. We propose ReFeX (Recursive Feature eXtraction), a novel algorithm, that recursively combines local (node-based) features with neighborhood (egonet-based) features; and outputs regional features - capturing "behavioral" information. We demonstrate how these powerful regional features can be used in within-network and across-network classification and de-anonymization tasks - without relying on homophily, or the availability of class labels. The contributions of our work are as follows: (a) ReFeX is scalable and (b) it is effective, capturing regional ("behavioral") information in large graphs. We report experiments on real graphs from various domains with over 1M edges, where ReFeX outperforms its competitors on typical graph mining tasks like network classification and de-anonymization.

AB - Given a graph, how can we extract good features for the nodes? For example, given two large graphs from the same domain, how can we use information in one to do classification in the other (i.e., perform across-network classification or transfer learning on graphs)? Also, if one of the graphs is anonymized, how can we use information in one to de-anonymize the other? The key step in all such graph mining tasks is to find effective node features. We propose ReFeX (Recursive Feature eXtraction), a novel algorithm, that recursively combines local (node-based) features with neighborhood (egonet-based) features; and outputs regional features - capturing "behavioral" information. We demonstrate how these powerful regional features can be used in within-network and across-network classification and de-anonymization tasks - without relying on homophily, or the availability of class labels. The contributions of our work are as follows: (a) ReFeX is scalable and (b) it is effective, capturing regional ("behavioral") information in large graphs. We report experiments on real graphs from various domains with over 1M edges, where ReFeX outperforms its competitors on typical graph mining tasks like network classification and de-anonymization.

KW - Feature extraction

KW - Graph mining

KW - Identity resolution

KW - Network classification

UR - http://www.scopus.com/inward/record.url?scp=80052674771&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80052674771&partnerID=8YFLogxK

U2 - 10.1145/2020408.2020512

DO - 10.1145/2020408.2020512

M3 - Conference contribution

AN - SCOPUS:80052674771

SN - 9781450308137

T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

SP - 663

EP - 671

BT - Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11

PB - Association for Computing Machinery

Y2 - 21 August 2011 through 24 August 2011

ER -

It's who you know: Graph mining using recursive structural features

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this