Skip to content

IndexError: list index out of range error in NYC Data #1009

@mansoorsyed11

Description

@mansoorsyed11

hey team im following as your document illustrates I'm getting the below error please help me why I'm getting this error
thanks.

Feathr version: 0.10.4-rc1
PROJECT_NAME ="nyc_taxi-"
SPARK_CLUSTER: databricks

Features that we want to request. Can use a subset of features

query = FeatureQuery(
feature_list=feature_names,
key=agg_key,
)
settings = ObservationSettings(
observation_path=data_source_path,
event_timestamp_column=TIMESTAMP_COL,
timestamp_format=TIMESTAMP_FORMAT,
)
client.get_offline_features(
observation_settings=settings,
feature_query=query,
# For more details, see https://feathr-ai.github.io/feathr/how-to-guides/feathr-job-configuration.html
execution_configurations=SparkExecutionConfiguration({
"spark.feathr.outputFormat": DATA_FORMAT,
}),
output_path=data_source_path.rpartition("/")[0] + f"/features.{DATA_FORMAT}",
)

client.wait_job_to_finish(timeout_sec=5000)

ERROR:

2023-01-23 10:29:10.173 | INFO | feathr.spark_provider._databricks_submission:upload_or_get_cloud_path:122 - Uploading folder /tmp/tmpast21ntk/feature_conf/
2023-01-23 10:29:10.689 | WARNING | feathr.utils._env_config_reader:get_from_env_or_akv:91 - Config ADLS_ACCOUNT is not found in the environment variable or the remote key value store.
2023-01-23 10:29:10.690 | WARNING | feathr.utils._env_config_reader:get_from_env_or_akv:91 - Config ADLS_KEY is not found in the environment variable or the remote key value store.
2023-01-23 10:29:10.691 | WARNING | feathr.utils._env_config_reader:get_from_env_or_akv:91 - Config BLOB_ACCOUNT is not found in the environment variable or the remote key value store.
2023-01-23 10:29:10.692 | WARNING | feathr.utils._env_config_reader:get_from_env_or_akv:91 - Config BLOB_KEY is not found in the environment variable or the remote key value store.
2023-01-23 10:29:10.692 | INFO | feathr.spark_provider._databricks_submission:submit_feathr_job:194 - Using an existing general purpose cluster to run the feathr job...
2023-01-23 10:29:10.693 | WARNING | feathr.spark_provider._databricks_submission:submit_feathr_job:197 - Spark execution configuration will be ignored. To use job-specific spark configs, please use a new job cluster or set the configs via Databricks UI.
2023-01-23 10:29:10.694 | WARNING | feathr.spark_provider._databricks_submission:submit_feathr_job:201 - Job tags will be ignored. To assign job tags to the cluster, please use a new job cluster.
2023-01-23 10:29:10.695 | INFO | feathr.spark_provider._databricks_submission:submit_feathr_job:228 - Main JAR file is not set, using default package 'com.linkedin.feathr:feathr_2.12:0.10.4-rc1' from Maven
IndexError: list index out of range

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions