Qubole Glossary

Showing 1 to 10 of 65
Definition: Also known as AWS.  The collection of cloud based technical products and services offered by Amazon including data storage, networking, processing and security. See also: https://aws.amazon.com
Ready
Definition: A standard for SQL languages outlining the syntax and semantics of the Structured Query Language See also: https://en.wikipedia.org/wiki/SQ
Definition: A component of Tez responsible for accepting job-submissions, tracking status and monitoring for progress on the Worker Nodes in the Cluster. See also: http://tez.apache.org
Ready
Definition: A feature offered by cloud services to enable the programmatic addition or removal of nodes from a cluster. See also: http://docs.qubole.com/en/latest/admin-guide/auto-scaling.htm
Ready
Definition: Serialized binary file format designed for more sophisticated data extraction and management, uses JSON to define objects. See also: https://avro.apache.org
Definition: Also known as BLOBS.  A data type designed for storing unstructured data in either text or binary format. BLOBS are used to store images and other multimedia files. See also: https://en.wikipedia.org/wiki/Binary_large_objec
Ready
Definition: The script or set of scripts that run immediately after initialization of a node. See also: http://docs.qubole.com/en/latest/user-guide/clusters/node-bootstrap.htm
Definition: Bootstrap is an open source toolkit for developing with HTML, CSS, and JS. Quickly prototype your ideas or build your entire app with our Sass variables and mixins, responsive grid system, extensive prebuilt components, and powerful plugins built on jQuery. ...
Ready
Definition: The logical collection of multiple computation focused machines arranged in a Master - Worker Node configuration. See also: http://docs.qubole.com/en/latest/user-guide/clusters/cluster-basics.htm
Ready
Definition: DataFames extend resilient distributed dataset (RDD) functionality while imposing a schema structure on the data, so table-like operations and SQL can be run against the manipulated data. See also: https://spark.apache.org/docs/latest/sql-programming-guide.html#datasets-and-dataframes