A common characteristic of relational data sets —degree disparity—can lead relational learning algorithms to discover misleading correlations. Degree disparity occurs when the frequency of a relation is correlated with the values of the target variable. In such cases, aggregation functions used by many relational learning algorithms will result in misleading correlations and added complexity in models. We examine this problem through a combination of simulations and experiments. We show how two novel hypothesis testing procedures can adjust for the effects of using aggregation functions in the presence of degree disparity. 1
Dependency networks approximate a joint probability distribution over multiple random variables as a...
In relational learning one learns patterns from relational databases, which usually contain multiple...
Due to interest in social and economic networks, relational modeling is attracting increasing attent...
A common characteristic of relational data sets ---degree disparity---can lead relational learning ...
Two common characteristics of relational data sets — concentrated linkage and relational auto-correl...
Many data sets routinely captured by organizations are relational in nature---from marketing and sal...
Abstract. Bias/variance analysis is a useful tool for investigating the performance of machine learn...
Abstract. Bias/variance analysis is a useful tool for investigating the performance of machine learn...
In relational learning, one learns patterns from rela-tional databases, which usually contain multip...
Many data sets routinely captured by organizations are relational in nature— from marketing and sale...
Dependency networks approximate a joint probability distribution over multiple random variables as a...
Autocorrelation, a common characteristic of many datasets, refers to correlation between values of t...
The primary difference between propositional (attribute-value) and relational data is the existence ...
Abstract. Multirelational classification algorithms search for patterns across multiple interlinked ...
Relational learning refers to learning from data that have a complex structure. This structure may ...
Dependency networks approximate a joint probability distribution over multiple random variables as a...
In relational learning one learns patterns from relational databases, which usually contain multiple...
Due to interest in social and economic networks, relational modeling is attracting increasing attent...
A common characteristic of relational data sets ---degree disparity---can lead relational learning ...
Two common characteristics of relational data sets — concentrated linkage and relational auto-correl...
Many data sets routinely captured by organizations are relational in nature---from marketing and sal...
Abstract. Bias/variance analysis is a useful tool for investigating the performance of machine learn...
Abstract. Bias/variance analysis is a useful tool for investigating the performance of machine learn...
In relational learning, one learns patterns from rela-tional databases, which usually contain multip...
Many data sets routinely captured by organizations are relational in nature— from marketing and sale...
Dependency networks approximate a joint probability distribution over multiple random variables as a...
Autocorrelation, a common characteristic of many datasets, refers to correlation between values of t...
The primary difference between propositional (attribute-value) and relational data is the existence ...
Abstract. Multirelational classification algorithms search for patterns across multiple interlinked ...
Relational learning refers to learning from data that have a complex structure. This structure may ...
Dependency networks approximate a joint probability distribution over multiple random variables as a...
In relational learning one learns patterns from relational databases, which usually contain multiple...
Due to interest in social and economic networks, relational modeling is attracting increasing attent...