pith. sign in

arxiv: 1809.06334 · v1 · pith:N5RB3X4Anew · submitted 2018-09-17 · ⚛️ physics.chem-ph · cs.LG· stat.ML

Powerful, transferable representations for molecules through intelligent task selection in deep multitask networks

classification ⚛️ physics.chem-ph cs.LGstat.ML
keywords tasklearningdeeprepresentationbiasdatalimitationsmethodology
0
0 comments X
read the original abstract

Chemical representations derived from deep learning are emerging as a powerful tool in areas such as drug discovery and materials innovation. Currently, this methodology has three major limitations - the cost of representation generation, risk of inherited bias, and the requirement for large amounts of data. We propose the use of multi-task learning in tandem with transfer learning to address these limitations directly. In order to avoid introducing unknown bias into multi-task learning through the task selection itself, we calculate task similarity through pairwise task affinity, and use this measure to programmatically select tasks. We test this methodology on several real-world data sets to demonstrate its potential for execution in complex and low-data environments. Finally, we utilise the task similarity to further probe the expressiveness of the learned representation through a comparison to a commonly used cheminformatics fingerprint, and show that the deep representation is able to capture more expressive task-based information.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.