As the size and scope of online data continues to grow, new machine learning techniques become necessary to best capitalize on the wealth of available information. However, the models that help convert data into knowledge require nontrivial processes to make sense of large collections of text and massive online graphs. In both scenarios, modern machine learning pipelines produce embeddings --- semantically rich vectors of latent features --- to convert human constructs for machine understanding. In this dissertation we focus on information available within biomedical science, including human-written abstracts of scientific papers, as well as machine-generated graphs of biomedical entity relationships. We present the Moliere system, and our ...
Graph structures, like syntax trees, social networks, and programs, are ubiquitous in many real worl...
Combining structured knowledge and neural language models to tackle natural language processing task...
Background Knowledge is often produced from data generated in scientific investigations. An ever-gro...
As the size and scope of online data continues to grow, new machine learning techniques become neces...
Hypothesis generation is becoming a crucial time-saving technique which allows biomedical researcher...
Recent advances in high throughput technologies have led to an increasing amount of rich and diverse...
Nowadays there is a tremendous amount of unstructured data, often represented by texts, which is cre...
In classical machine learning, hand-designed features are used for learning a mapping from raw data....
abstract: Unstructured texts containing biomedical information from sources such as electronic healt...
As science advances, the underlying literature grows rapidly providing valuable knowledge mines for ...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
In modern biomedicine, the role of computation becomes more crucial in light of the ever-increasing ...
Medical research is expensive and risky. Drug manufacturers need to prioritize their early investmen...
Graphs are extensively employed in many systems due to their capability to capture the interactions ...
Information retrieval on graphs has applications in diverse areas, including analyzing social networ...
Graph structures, like syntax trees, social networks, and programs, are ubiquitous in many real worl...
Combining structured knowledge and neural language models to tackle natural language processing task...
Background Knowledge is often produced from data generated in scientific investigations. An ever-gro...
As the size and scope of online data continues to grow, new machine learning techniques become neces...
Hypothesis generation is becoming a crucial time-saving technique which allows biomedical researcher...
Recent advances in high throughput technologies have led to an increasing amount of rich and diverse...
Nowadays there is a tremendous amount of unstructured data, often represented by texts, which is cre...
In classical machine learning, hand-designed features are used for learning a mapping from raw data....
abstract: Unstructured texts containing biomedical information from sources such as electronic healt...
As science advances, the underlying literature grows rapidly providing valuable knowledge mines for ...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
In modern biomedicine, the role of computation becomes more crucial in light of the ever-increasing ...
Medical research is expensive and risky. Drug manufacturers need to prioritize their early investmen...
Graphs are extensively employed in many systems due to their capability to capture the interactions ...
Information retrieval on graphs has applications in diverse areas, including analyzing social networ...
Graph structures, like syntax trees, social networks, and programs, are ubiquitous in many real worl...
Combining structured knowledge and neural language models to tackle natural language processing task...
Background Knowledge is often produced from data generated in scientific investigations. An ever-gro...