The Data Mining and Statistics (DMS) Group Manager at the Texas Advanced Computing Center (TACC) will be in charge of the group leading the design, implementation, application, and support of the technologies enabling the wide array of data driven research and analytics performed at TACC.
This position will lead the DMS staff as TACC expands its role as a national facility supporting data computing, and will recruit new researchers and developers to expand its efforts and impact. The manager will participate in grant proposal design, development, and execution to continue to expand and evolve the data computational capabilities at TACC. The DMS manager will also be responsible for expanding the capabilities and data computational offerings at TACC by way of collaborations with technology developers and vendors. The manager will coordinate work between the DMS team and scientists from many disciplines and institutions as they integrate data computing techniques in their work. This position will be in charge of user support and the day-to-day operational needs of the hardware and software systems supported by the group, and will participate in the development of both online and in-person training to mentor researchers in the data computational capabilities supported at TACC. The specific activities of the TACC DMS Manager will include: Managing the DMS team and recruiting new staff to support and expand the data computational research productivity at TACC; Designing new systems and working with hardware and software vendors and developers to grow and tailor the data computation capabilities at TACC to the needs of current and future researchers; Working with researchers and engineers to match technologies and TACC staff expertise to their data computational research needs; Writing proposals and conducting research and development to develop new technologies that enable data intensive computing, and sharing and publishing the results; Communicating and educating TACC staff about the capabilities and support needs of technologies and systems used for data computations; Establishing, maintaining, and documenting internal and external data policies guidelines, processes, and systems; Providing information and reports required by TACC management, staff, and outside agencies; Attending workshops, conferences, and other meetings to publicize TACC's data computing capabilities and seek out collaborative opportunities; Communicating with the non-researchers to educate the public about the impact that data driven research makes to the world; Ensuring that budgets are met and resources are used effectively.
Other related functions as assigned.
PhD degree in Computer Science, Statistics, or other related Data Science research field. At least 6 years' experience working in a data computation field; At least 3 years' post-doctoral experience in developing and/or managing data mining and analytics routines or data workflows; Experience working with domain expert researchers and engineers to integrate data computing techniques into their research activities; Experience in developing and submitting grant proposals supporting computing research. Ability to lead multiple projects from concept to completion; Ability to work with domain experts and researchers to help craft solutions for their data computational needs; The ability to learn and adapt new technologies to enable new capabilities or improve on existing ones; Excellent written and verbal communications skills; Excellent project management skills; Excellent problem solving and strategic thinking skills; Excellent interpersonal skills. Equivalent combination of relevant education and experience may be substituted as appropriate.
At least 5 years' experience in developing and managing data mining and analytics routines or data workflows; Experience leading a team of domain and technology experts; Experience in designing and deploying data computational systems; Experience with Hadoop MapReduce, H-Base, Pig, Storm, Sparc, and/or other data analytics related technologies; Experience with data mining and machine learning using R, Python, MATLAB, or other common data science languages; Experience with data driven workflows and workflow managers; Experience teaching data computing techniques to a wide range of students and researchers. Experience developing and teaching undergraduate and graduate level classes; Experience working with MPI, OpenMP, and other forms of parallel computing; Experience working with Xeon Phi or GPU based computing.
May work around standard office conditions Repetitive use of a keyboard at a workstation