Machine learning and distributed computing approaches for quantum chemistry-based data generation and molecular property prediction