Welcome to fabric8-analytics-rudra’s documentation!¶
Indices and tables¶
Data Store and Retrieval from various Storage.
Basic interface to the Amazon S3.
-
class
rudra.data_store.aws.AmazonEmr(*args, **kwargs)[source]¶ Bases:
rudra.data_store.aws.AmazonS3Basic interface to the Amazon EMR.
-
class
rudra.data_store.aws.AmazonS3(aws_access_key_id=None, aws_secret_access_key=None, bucket_name=None, region_name=None, use_ssl=False, encryption=None, versioned=None, local_dev=False, endpoint_url=None)[source]¶ Bases:
rudra.data_store.abstract_data_store.AbstractDataStoreBasic interface to the Amazon S3.
-
load_matlab_multi_matrix(s3_path)[source]¶ Load a ‘.mat’file & return a dict representation.
- S3_path
The path of the object in the S3 bucket.
- Returns
A dict containing numpy matrices against the keys of the multi-matrix.
-
object_exists(object_key)[source]¶ Check if the there is an object with the given key in bucket, does only HEAD request.
-
-
exception
rudra.data_store.aws.NotFoundAccessKeySecret[source]¶ Bases:
ExceptionException for invalid AWS secret/key.
Local data_store interface.
-
class
rudra.data_store.local_data_store.LocalDataStore(src_dir)[source]¶ Bases:
rudra.data_store.abstract_data_store.AbstractDataStoreWrapper on local filesystem, API similar to s3DataStore.
Google Bigquery data collection implementation.
Implementation Bigquery builder base.
-
class
rudra.data_store.bigquery.base.BigqueryBuilder(query_job_config=None)[source]¶ Bases:
objectBigqueryBuilder class Implementation.
-
class
rudra.data_store.bigquery.base.DataProcessing(s3_client=None)[source]¶ Bases:
objectProcess the Bigquery Data.
Maven bigquery implementation.
-
class
rudra.data_store.bigquery.maven_bigquery.MavenBQDataProcessing(big_query_instance=None, s3_client=None, file_name='collated.json')[source]¶ Bases:
rudra.data_store.bigquery.base.DataProcessingImplementation data processing for maven bigquery.
-
class
rudra.data_store.bigquery.maven_bigquery.MavenBigQuery(*args, **kwargs)[source]¶ Bases:
rudra.data_store.bigquery.base.BigqueryBuilderMavenBigQuery Implementation.
Npm bigquery implementation.
-
class
rudra.data_store.bigquery.npm_bigquery.NpmBQDataProcessing(big_query_instance=None, s3_client=None, file_name='collated.json')[source]¶ Bases:
rudra.data_store.bigquery.base.DataProcessingImplementation data processing for npm bigquery.
-
class
rudra.data_store.bigquery.npm_bigquery.NpmBigQuery(*args, **kwargs)[source]¶ Bases:
rudra.data_store.bigquery.base.BigqueryBuilderNpmBigQuery Implementation.
Deployments scripts.
EMR Deployments.
-
class
rudra.deployments.emr_scripts.MavenEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderMaven Emr script implementation.
-
ecosystem= 'maven'¶
-
-
class
rudra.deployments.emr_scripts.NpmEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderNPM Emr script implementation.
-
ecosystem= 'npm'¶
-
-
class
rudra.deployments.emr_scripts.PyPiEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderPyPi Emr script implementation.
-
ecosystem= 'pypi'¶
-
Configurations for EMR instance.
-
class
rudra.deployments.emr_scripts.emr_config.EMRConfig(name, log_uri, ecosystem, s3_bootstrap_uri, training_repo_url, training_file_name='training/train.py', release_label='emr-5.10.0', instance_count=1, instance_type='m3.xlarge', applications=[{'Name': 'MXNet'}], visible_to_all_users=True, job_flow_role='EMR_EC2_DefaultRole', service_role='EMR_DefaultRole', properties={}, hyper_params='{}')[source]¶ Bases:
objectConfig class for EMR.
-
home_dir= '/home/hadoop'¶
-
EMR script builder implementation.
-
class
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilder[source]¶ Bases:
rudra.deployments.emr_scripts.abstract_emr.AbstractEMREMR Script implementation.
EMR script implementation for the Maven service.
-
class
rudra.deployments.emr_scripts.maven_emr.MavenEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderMaven Emr script implementation.
-
ecosystem= 'maven'¶
-
EMR script implementation for the NPM service.
-
class
rudra.deployments.emr_scripts.npm_emr.NpmEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderNPM Emr script implementation.
-
ecosystem= 'npm'¶
-
EMR script implementation for the PYPI service.
-
class
rudra.deployments.emr_scripts.pypi_emr.PyPiEMR[source]¶ Bases:
rudra.deployments.emr_scripts.emr_script_builder.EMRScriptBuilderPyPi Emr script implementation.
-
ecosystem= 'pypi'¶
-
Package for various utils function.
Validation Utility module.
-
rudra.utils.validation.check_field_exists(input_data, fields)[source]¶ Check field exist in the input data.
-
rudra.utils.validation.check_url_alive(url, accept_codes=[401])[source]¶ Validate github repo exist or not.
Utility helper functions.
-
class
rudra.utils.helper.CacheDict(max_len=1024)[source]¶ Bases:
objectCacheDict implementation with max size limit.
-
rudra.utils.helper.get_training_file_url(user, repo, branch='master', training_file_path='training/train.py')[source]¶ Get the training file from the github repo.
-
rudra.utils.helper.load_hyper_params()[source]¶ Load the hyper parameter from the command line args.
Mercator: implementation of dependencies finder.