JD Tech
Aug 14, 2018 · Artificial Intelligence
GCN‑LSTM Image Captioning Model by JD AI Research Institute
JD AI Research Institute presented a GCN‑LSTM encoder‑decoder system that integrates object semantic and spatial relationships via graph convolutional networks to significantly improve image captioning performance on the COCO benchmark, achieving state‑of‑the‑art results.
COCO datasetLSTMcomputer vision
0 likes · 7 min read