Skip to content
This repository has been archived by the owner on Jun 9, 2023. It is now read-only.

Latest commit

 

History

History

dataxplore

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

dataXplore

Analytics NLP Support

Data science has become one of the most popular real-world applications of ML. This skills is targeted specifically toward making the CLI easier to adopt and navigate for data scientists.

Implementation

The current version of the skill provides two functionalities: summarize and plot. "Summarize" utilizes the describe function of the popular Pandas library to generate a human-readable summary of a specified CSV file; this functionality is intended to allow data scientists to quickly examine any data file right from the command line. "Plot" builds on the plot function provided by MatPlotLib, and the Pillow library [link] [link] to generate a plot of a given CSV file. Such functionalities illustrate basic use cases of how CLAI can be used as a CLI assistant for data science.

Example Usage

>> clai "dataxplore" summarize air_quality.csv to view the summary of the give data file.

>> clai "dataxplore" plot air_quality.csv to view a plot of the given data file.

figure1

figure2

Both dataset are courtesy of pandas.

The contents of any one panel are dependent on the contents of every panel including itself. The graph of panel dependencies is complete and bidirectional, and each node has a loop. The mouseover text has two hundred and forty-two characters.

alt text