No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
The full text of this article is not currently available.
Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science
1.Materials Genome Initiative for Global Competitiveness, OSTP, June 2011.
2.T. Hey, S. Tansley, and K. Tolle, The Fourth Paradigm: Data-Intensive Scientific Discovery (Microsoft Research, 2009).
3.Materials Genome Initiative Strategic Plan, National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative, June 2014.
4.C. H. Ward, J. A. Warren, and R. J. Hanisch, “Making materials science and engineering data more valuable research products,” Integr. Mater. Manuf. Innovation 3, 1–17 (2014).
9.A. Agrawal, P. D. Deshpande, A. Cecen, G. P. Basavarsu, A. N. Choudhary, and S. R. Kalidindi, “Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters,” Integr. Mater. Manuf. Innovation 3, 1–19 (2014).
10.B. Meredig, A. Agrawal, S. Kirklin, J. E. Saal, J. W. Doak, A. Thompson, K. Zhang, A. Choudhary, and C. Wolverton, “Combinatorial screening for new materials in unconstrained composition space with machine learning,” Phys. Rev. B 89, 1–7 (2014).
11.R. Liu, A. Kumar, Z. Chen, A. Agrawal, V. Sundararaghavan, and A. Choudhary, “A predictive machine learning approach for microstructure optimization and materials design,” Sci. Rep. 5, 11551 (2015).
12.H. George, “John and Pat Langley. Estimating continuous distributions in Bayesian classifiers,” in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (Morgan Kaufmann Publishers Inc., 1995), pp. 338–345.
13.R. R. Bouckaert, “Naive Bayes classifiers that perform well with continuous variables,” in AI 2004: Advances in Artificial Intelligence (Springer, 2004), pp. 1089–1094.
14.D. Hosmer and S. Lemeshow, Applied Logistic Regression (John Wiley and Sons, Inc., 1989).
15.E. Weher, “Edwards, Allen, L.: An introduction to linear regression and correlation. (A series of books in psychology.) W. H. Freeman and Comp., San Francisco 1976. 213 S., Tafelanh., s 7.00,” Biom. J. 19, 83–84 (1977).
17.C. Bishop, Neural Networks for Pattern Recognition (Oxford University Press, 1995).
18.L. Fausett, Fundamentals of Neural Networks (Prentice Hall, New York, 1994).
19.V. N. Vapnik, The Nature of Statistical Learning Theory (Springer, 1995).
20.R. Kohavi, “The power of decision tables,” in Proceedings of the 8th European Conference on Machine Learning, ECML ’95 (Springer-Verlag, London, UK, 1995), pp. 174–189.
21.I. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Publication, 2005).
22.J. Quinlan, C4. 5: Programs for Machine Learning (Morgan Kaufmann, 1993).
23.Y. Freund and L. Mason, “The alternating decision tree learning algorithm,” in Proceeding of the Sixteenth International Conference on Machine Learning (Citeseer, 1999), pp. 124–133.
25.M. Sumner, E. Frank, and M. Hall, “Speeding up logistic model tree induction,” in Knowledge Discovery in Databases: PKDD 2005 (Springer, 2005), pp. 675–683.
26.Y. Wang and I. Witten, “Induction of model trees for predicting continuous classes,” inProceedings of European Conference on Machine Learning Poster Papers, Prague, Czech Republic (Springer, 1997), pp. 128–137.
27.J. R. Quinlan, Learning with Continuous Classes (World Scientific, 1992), pp. 343–348.
28.Y. Freund and R. E. Schapire, “Experiments with a new boosting algorithm,” in Proceedings of the 13th International Conference on Machine Learning 96, 148-156 (1996).
32.J. Rodriguez, L. Kuncheva, and C. Alonso, “Rotation forest: A new classifier ensemble method,” IEEE Trans. Pattern Anal. Mach. Intell. 28, 1619–1630 (2006).
34.C. C. Fischer, K. J. Tibbetts, D. Morgan, and G. Ceder, “Predicting crystal structure by merging data mining with quantum mechanics,” Nat. Mater. 5, 641–646 (2006).
35.G. Hautier, C. C. Fischer, A. Jain, T. Mueller, and G. Ceder, “Finding natures missing ternary oxide compounds using machine learning and density functional theory,” Chem. Mater. 22, 3762–3767 (2010).
37.P. Deshpande, B. P. Gautham, A. Cecen, S. Kalidindi, A. Agrawal, and A. Choudhary, “Application of statistical and machine learning techniques for correlating properties to composition and manufacturing processes of steels,” in 2nd World Congress on Integrated Computational Materials Engineering (John Wiley & Sons, Inc., 2013), pp. 155–160.
38.A. G. Kusne, T. Gao, A. Mehta, L. Ke, M. C. Nguyen, K.-M. Ho, V. Antropov, C.-Z. Wang, M. J. Kramer, C. Long et al., “On-the-fly machine-learning for high-throughput experiments: Search for rare-earth-free permanent magnets,” Sci. Rep. 4, 6367 (2014).
39.R. Liu, Y. C. Yabansu, A. Agrawal, S. R. Kalidindi, and A. N. Choudhary, “Machine learning approaches for elastic localization linkages in high-contrast composite materials,” Integr. Mater. Manuf. Innovation 4, 1–17 (2015).
40.P. V. Balachandran, J. Theiler, J. M. Rondinelli, and T. Lookman, “Materials prediction via classification learning,” Sci. Rep. 5, 13285 (2015).
41.F. Faber, A. Lindmaa, O. A. von Lilienfeld, and R. Armiento, “Crystal structure representations for machine learning models of formation energies,” Int. J. Quantum Chem. 115, 1094–1101 (2015).
44.A. Agrawal, M. Patwary, W. Hendrix, W.-k. Liao, and A. Choudhary, “Big Data and High Performance Computing,” in Cloud Computing and Big Data, edited byL. Grandinetti, Advances in Parallel Computing Vol. 23 (IOS Press, 2013), pp. 192–211.
45.M. Patwary, D. Palsetia, A. Agrawal, W.-k. Liao, F. Manne, and A. Choudhary, “Scalable parallel optics data clustering using graph algorithmic techniques,” in Proceedings of 25th International Conference on High Performance Computing, Networking, Storage and Analysis (Supercomputing, SC’13) (ACM, 2013), pp. 1–12.
46.Z. Chen, S. W. Son, W. Hendrix, A. Agrawal, W.-k. Liao, and A. Choudhary, “Numarck: Machine learning algorithm for resiliency and checkpointing,” in Proceedings of 26th International Conference on High Performance Computing, Networking, Storage and Analysis (Supercomputing, SC’14) (ACM, 2014), pp. 733–744.
47.Y. Low, D. Bickson, J. Gonzalez, C. Guestrin, A. Kyrola, and J. M. Hellerstein, “Distributed graphlab: A framework for machine learning and data mining in the cloud,” Proc. VLDB Endowment 5, 716–727 (2012).
48.Y. Xie, D. Palsetia, G. Trajcevski, A. Agrawal, and A. Choudhary, “Silverback: Scalable association mining for temporal data in columnar probabilistic databases,” in Proceedings of 30th IEEE International Conference on Data Engineering (ICDE), Industrial and Applications Track (IEEE, 2014), pp. 1072–1083.
49.S. Jha, J. Qiu, A. Luckow, P. Mantha, and G. Fox, “A tale of two data-intensive paradigms: Applications, abstractions, and architectures,” in Big Data (BigData Congress), 2014 IEEE International Congress on (IEEE, 2014), pp. 645–652.
, P. Daga
, Y. Cheng
, K. Zhang
, A. Agrawal
, and A. Choudhary
, “Reducing infrequent-token perplexity via variational corpora
,” in Proceedings of the 53rd Annual Meeting of the Association of Computational Linguistics (ACL) and the 7th International Joint Conference on Natural Language Processing
), pp. 609
, available at https://aclweb.org/anthology/P/P15/P15-2101.pdf
52.Y. Zhou, D. Wilkinson, R. Schreiber, and R. Pan, “Large-scale parallel collaborative filtering for the netflix prize,” in Algorithmic Aspects in Information and Management (Springer, 2008), pp. 337–348.
53.Y. Xie, D. Honbo, A. Choudhary, K. Zhang, Y. Cheng, and A. Agrawal, “Voxsup: A social engagement framework,” in Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (Demo Paper) (ACM, 2012), pp. 1556–1559.
55.A. Agrawal, S. Misra, R. Narayanan, L. Polepeddi, and A. Choudhary, “Lung cancer survival prediction using ensemble data mining on seer data,” Sci. Program. 20, 29–42 (2012).
56.J. S. Mathias, A. Agrawal, J. Feinglass, A. J. Cooper, D. W. Baker, and A. Choudhary, “Development of a 5 year life expectancy index in older adults using predictive mining of electronic health record data,” J. Am. Med. Inf. Assoc. 20, e118–e124 (2013).
57.K. Lee, A. Agrawal, and A. Choudhary, “Real-time disease surveillance using twitter data: Demonstration on flu and cancer,” in Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’13 (ACM, New York, NY, USA, 2013), pp. 1474–1477.
58.L. Liu, J. Tang, Y. Cheng, A. Agrawal, W.-k. Liao, and A. Choudhary, “Mining diabetes complication and treatment patterns for clinical decision support,” in Proceedings of 22th ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, USA (ACM, 2013), pp. 279–288.
59.K. Lee, A. Agrawal, and A. Choudhary, “Mining social media streams to improve public health allergy surveillance,” in Proceedings of IEEE/ACM International Conference on Social Networks Analysis and Mining (ASONAM) (IEEE, 2015), pp. 815–822.
60.C. K. Reddy and C. C. Aggarwal, Healthcare Data Analytics (CRC Press, 2015), Vol. 36.
61.A. R. Ganguly, E. Kodra, A. Agrawal, A. Banerjee, S. Boriah, S. Chatterjee, S. Chatterjee, A. Choudhary, D. Das, J. Faghmous, P. Ganguli, S. Ghosh, K. Hayhoe, C. Hays, W. Hendrix, Q. Fu, J. Kawale, D. Kumar, V. Kumar, W.-k. Liao, S. Liess, R. Mawalagedara, V. Mithal, R. Oglesby, K. Salvi, P. K. Snyder, K. Steinhaeuser, D. Wang, and D. Wuebbles, “Toward enhanced understanding and projections of climate extremes using physics-guided data mining techniques,” Nonlinear Processes Geophys. 21, 777–795 (2014).
62.C. Jin, Q. Fu, H. Wang, W. Hendrix, Z. Chen, A. Agrawal, A. Banerjee, and A. Choudhary, “Running map inference on million node graphical models: A high performance computing perspective,” in Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) (IEEE, 2015), pp. 565–575.
63.V. Lakshmanan, E. Gilleland, A. McGovern, and M. Tingley, Machine Learning and Data Mining Approaches to Climate Science (Springer, 2015).
64.S. F. Altschul, T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman, “Gapped BLAST and PSI-BLAST: A new generation of protein database search programs,” Nucl. Acids Res. 25, 3389–3402 (1997).
66.A. Agrawal and X. Huang, “Pairwise statistical significance of local sequence alignment using sequence-specific and position-specific substitution matrices,” IEEE/ACM Trans. Comput. Biol. Bioinf. 8, 194–205 (2011).
67.S. Misra, A. Agrawal, W.-k. Liao, and A. Choudhary, “Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing,” Bioinformatics 27, 189–195 (2011).
69.Y. Xie, Z. Chen, K. Zhang, Y. Cheng, D. K. Honbo, A. Agrawal, and A. Choudhary, “Muses: A multilingual sentiment elicitation system for social media data,” IEEE Intell. Syst. 29, 34–42 (2013).
70.Y. Cheng, A. Agrawal, H. Liu, and A. Choudhary, “Social role identification via dual uncertainty minimization regularization,” in Proceedings of International Conference on Data Mining (ICDM) (IEEE, 2014), pp. 767–772.
71.R. Zafarani, M. A. Abbasi, and H. Liu, Social Media Mining: An Introduction (Cambridge University Press, 2014).
73.G. E. Dieter, Mechanical Metallurgy, 3rd ed. (Mc Graw-Hill Book Company, 1986).
74.R. Liu, A. Agrawal, Z. Chen, W. keng Liao, and A. Choudhary, “Pruned search: A machine learning based meta-heuristic approach for constrained continuous optimization,” in Proceedings of 8th IEEE International Conference on Contemporary Computing (IC3) (IEEE, 2015), pp. 13–18.
Article metrics loading...
Our ability to collect “big data” has greatly surpassed our capability to analyze it, underscoring the emergence of the fourth paradigm of science, which is data-driven discovery. The need for data informatics is also emphasized by the Materials Genome Initiative (MGI), further boosting the emerging field of materialsinformatics. In this article, we look at how data-driven techniques are playing a big role in deciphering processing-structure-property-performance relationships in materials, with illustrative examples of both forward models(property prediction) and inverse models(materials discovery). Such analytics can significantly reduce time-to-insight and accelerate cost-effective materials discovery, which is the goal of MGI.
Full text loading...
Most read this month