Welcome to fabric8-analytics-rudra’s documentation!¶
Indices and tables¶
Data Store and Retrieval from various Storage.
Basic interface to the Amazon S3.
-
class
rudra.data_store.aws.
AmazonEmr
(*args, **kwargs)[source]¶ Bases:
rudra.data_store.aws.AmazonS3
Basic interface to the Amazon EMR.
-
class
rudra.data_store.aws.
AmazonS3
(aws_access_key_id=None, aws_secret_access_key=None, bucket_name=None, region_name=None, use_ssl=False, encryption=None, versioned=None, local_dev=False, endpoint_url=None)[source]¶ Bases:
rudra.data_store.abstract_data_store.AbstractDataStore
Basic interface to the Amazon S3.
-
load_matlab_multi_matrix
(s3_path)[source]¶ Load a ‘.mat’file & return a dict representation.
- S3_path
The path of the object in the S3 bucket.
- Returns
A dict containing numpy matrices against the keys of the multi-matrix.
-
object_exists
(object_key)[source]¶ Check if the there is an object with the given key in bucket, does only HEAD request.
-
-
exception
rudra.data_store.aws.
NotFoundAccessKeySecret
[source]¶ Bases:
Exception
Exception for invalid AWS secret/key.
Local data_store interface.
-
class
rudra.data_store.local_data_store.
LocalDataStore
(src_dir)[source]¶ Bases:
rudra.data_store.abstract_data_store.AbstractDataStore
Wrapper on local filesystem, API similar to s3DataStore.
Google Bigquery data collection implementation.
Implementation Bigquery builder base.
-
class
rudra.data_store.bigquery.base.
BigqueryBuilder
(query_job_config=None)[source]¶ Bases:
object
BigqueryBuilder class Implementation.
-
class
rudra.data_store.bigquery.base.
DataProcessing
(s3_client=None)[source]¶ Bases:
object
Process the Bigquery Data.
Maven bigquery implementation.
-
class
rudra.data_store.bigquery.maven_bigquery.
MavenBQDataProcessing
(big_query_instance=None, s3_client=None, file_name='collated.json')[source]¶ Bases:
rudra.data_store.bigquery.base.DataProcessing
Implementation data processing for maven bigquery.
-
class
rudra.data_store.bigquery.maven_bigquery.
MavenBigQuery
(*args, **kwargs)[source]¶ Bases:
rudra.data_store.bigquery.base.BigqueryBuilder
MavenBigQuery Implementation.
Npm bigquery implementation.
-
class
rudra.data_store.bigquery.npm_bigquery.
NpmBQDataProcessing
(big_query_instance=None, s3_client=None, file_name='collated.json')[source]¶ Bases:
rudra.data_store.bigquery.base.DataProcessing
Implementation data processing for npm bigquery.
-
class
rudra.data_store.bigquery.npm_bigquery.
NpmBigQuery
(*args, **kwargs)[source]¶ Bases:
rudra.data_store.bigquery.base.BigqueryBuilder
NpmBigQuery Implementation.
Deployments scripts.
EMR Deployments.
-
class
rudra.deployments.emr_scripts.
MavenEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
Maven Emr script implementation.
-
ecosystem
= 'maven'¶
-
-
class
rudra.deployments.emr_scripts.
NpmEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
NPM Emr script implementation.
-
ecosystem
= 'npm'¶
-
-
class
rudra.deployments.emr_scripts.
PyPiEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
PyPi Emr script implementation.
-
ecosystem
= 'pypi'¶
-
Configurations for EMR instance.
-
class
rudra.deployments.emr_scripts.emr_config.
EMRConfig
(name, log_uri, ecosystem, s3_bootstrap_uri, training_repo_url, training_file_name='training/train.py', release_label='emr-5.10.0', instance_count=1, instance_type='m3.xlarge', applications=[{'Name': 'MXNet'}], visible_to_all_users=True, job_flow_role='EMR_EC2_DefaultRole', service_role='EMR_DefaultRole', properties={}, hyper_params='{}')[source]¶ Bases:
object
Config class for EMR.
-
home_dir
= '/home/hadoop'¶
-
EMR script builder implementation.
-
class
rudra.deployments.emr_scripts.emr_script_builder.
EMRScriptBuilder
[source]¶ Bases:
rudra.deployments.emr_scripts.abstract_emr.AbstractEMR
EMR Script implementation.
EMR script implementation for the Maven service.
-
class
rudra.deployments.emr_scripts.maven_emr.
MavenEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
Maven Emr script implementation.
-
ecosystem
= 'maven'¶
-
EMR script implementation for the NPM service.
-
class
rudra.deployments.emr_scripts.npm_emr.
NpmEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
NPM Emr script implementation.
-
ecosystem
= 'npm'¶
-
EMR script implementation for the PYPI service.
-
class
rudra.deployments.emr_scripts.pypi_emr.
PyPiEMR
[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder
PyPi Emr script implementation.
-
ecosystem
= 'pypi'¶
-
Package for various utils function.
Validation Utility module.
-
rudra.utils.validation.
check_field_exists
(input_data, fields)[source]¶ Check field exist in the input data.
-
rudra.utils.validation.
check_url_alive
(url, accept_codes=[401])[source]¶ Validate github repo exist or not.
Utility helper functions.
-
class
rudra.utils.helper.
CacheDict
(max_len=1024)[source]¶ Bases:
object
CacheDict implementation with max size limit.
-
rudra.utils.helper.
get_training_file_url
(user, repo, branch='master', training_file_path='training/train.py')[source]¶ Get the training file from the github repo.
-
rudra.utils.helper.
load_hyper_params
()[source]¶ Load the hyper parameter from the command line args.
Mercator: implementation of dependencies finder.