GNNs and LLMs: Advances in Graph Learning

The integration of Graph Neural Networks (GNNs) and Large Language Models (LLMs) is driving significant advancements in graph-based learning. A notable trend is the development of methods that leverage LLMs to enhance graph data preprocessing and feature extraction, enabling more effective cross-graph feature alignment and node classification. These innovations are particularly impactful in scenarios where textual data is scarce or non-existent, as they allow for the synthesis of text-attributed graphs from traditional graphs. Additionally, the use of LLMs in a cost-effective, label-free node classification framework is demonstrating superior performance over traditional methods, reducing the dependency on expensive labeled data. The synergy between GNNs and LLMs is also being explored to improve data quality and model generalization across diverse tasks and domains, with applications ranging from gene ontology annotation to career path prediction. Notably, the creation of unified datasets and benchmarks is further solidifying the foundation for these advancements, ensuring reliable and scalable solutions for real-world challenges.

Noteworthy Papers:

  • Topology-Aware Node description Synthesis (TANS): Demonstrates the potential of LLMs for preprocessing graph-structured data, even in the absence of textual information.
  • Cella: Introduces an active self-training framework that significantly improves label-free node classification performance while reducing query costs to LLMs.
  • UniEntrezDB: Pioneers the unification of gene ontology annotations, enhancing the reliability and applicability of LLMs in gene research.

Sources

Can LLMs Convert Graphs to Text-Attributed Graphs?

Cost-Effective Label-free Node Classification with LLMs

Graph Learning in the Era of LLMs: A Survey from the Perspective of Data, Models, and Tasks

UniEntrezDB: Large-scale Gene Ontology Annotation Dataset and Evaluation Benchmarks with Unified Entrez Gene Identifiers

KARRIEREWEGE: A Large Scale Career Path Prediction Dataset

Built with on top of