Data Scientists: data scientists are needed to cover the full spectrum of a data-science project lifecycle and from areas covering all verticals within data science to include structured and unstructured data, high-speed (high “velocity”) analytics, in-database applications, etc. Essentially, verticals that will need to be represented include Statistics (descriptive and inferential), Data Analysis, Database Engineering, Computer Programming, Linguistics / Natural Language Processing, Data Mining / Text Mining, Machine Learning and Information Retrieval. It is likely that one Data Scientist will be able to address more than one of these verticals. Data Scientists need not have a legal background but this is desirable.
Legal Professionals: attorneys, paralegals, litigation support, investigators and similar professionals with strong technical background who can assist with addressing legal principles in the application of data-science process, ensure cross-functional integrity of process and assist with best practices for use of data science in the legal profession. This is the first time this effort is known to have occurred so requirements for this area remain undefined in many areas.
To compound this, often, professionals from one data-science vertical are not familiar with what others do. This is one more reason a common model is needed – to keep all professionals across the data-science verticals focused in the same direction.
The Big Data Committee is currently working on a book series. The books will include educational material, best practices, practical advice, where to find useful resources, etc. The outlines are currently being refined but will include topics organized by practice area where use cases will provide hypothetical scenarios and where data-science solutions will be utilized. It is expected that most major legal-profession domain areas will be represented and major areas of data science will be discussed.