Abstract
A central theme in learning from image data is to develop appropriate image representations for the specific task at hand. Traditional methods used handcrafted local features combined with high-level image representations to generate image-level representations. Thus, a practical challenge is to determine what features are appropriate for specific tasks. For example, in the study of gene expression patterns in Drosophila melanogaster, texture features based on wavelets were particularly effective for determining the developmental stages from in situ hybridization (ISH) images. Such image representation is however not suitable for controlled vocabulary (CV) term annotation because each CV term is often associated with only a part of an image. Here, we developed problem-independent feature extraction methods to generate hierarchical representations for ISH images. Our approach is based on the deep convolutional neural networks (CNNs) that can act on image pixels directly. To make the extracted features generic, the models were trained using a natural image set with millions of labeled examples. These models were transferred to the ISH image domain and used directly as feature extractors to compute image representations. Furthermore, we employed multi-task learning method to fine-tune the pre-trained models with labeled ISH images, and also extracted features from the fine-tuned models. Experimental results showed that feature representations computed by deep models based on transfer and multi-task learning significantly outperformed other methods for annotating gene expression patterns at different stage ranges. We also demonstrated that the intermediate layers of deep models produced the best gene expression pattern representations.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining |
Publisher | Association for Computing Machinery |
Pages | 1475-1484 |
Number of pages | 10 |
Volume | 2015-August |
ISBN (Print) | 9781450336642 |
DOIs | |
State | Published - Aug 10 2015 |
Externally published | Yes |
Event | 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015 - Sydney, Australia Duration: Aug 10 2015 → Aug 13 2015 |
Other
Other | 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2015 |
---|---|
Country/Territory | Australia |
City | Sydney |
Period | 8/10/15 → 8/13/15 |
Keywords
- Bioinformatics
- Deep learning
- Image analysis
- Multi-task learning
- Transfer learning
ASJC Scopus subject areas
- Software
- Information Systems