Avoiding failure when using frozen indices #1842

masseyke · 2022-01-04T22:04:10Z

This commit changes es-hadoop to treat frozen indices as empty indices rather than throwing exceptions. Before
this change if you query a frozen index with es-hadoop or spark you get job failure with an exception like this:

org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3) (localhost executor 2): org.elasticsearch.hadoop.EsHadoopIllegalArgumentException: invalid response
	at org.elasticsearch.hadoop.util.Assert.isTrue(Assert.java:60)
	at org.elasticsearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:272)
	at org.elasticsearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:263)
	at org.elasticsearch.hadoop.rest.RestRepository.scroll(RestRepository.java:313)
	at org.elasticsearch.hadoop.rest.ScrollQuery.hasNext(ScrollQuery.java:94)
	at org.elasticsearch.spark.rdd.AbstractEsRDDIterator.hasNext(AbstractEsRDDIterator.scala:66)
	at org.apache.spark.util.Utils$.getIteratorSize(Utils.scala:1889)
	at org.apache.spark.rdd.RDD.$anonfun$count$1(RDD.scala:1253)
	at org.apache.spark.rdd.RDD.$anonfun$count$1$adapted(RDD.scala:1253)
	at org.apache.spark.SparkContext.$anonfun$runJob$5(SparkContext.scala:2254)
	at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:90)
	at org.apache.spark.scheduler.Task.run(Task.scala:131)
	at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:506)
	at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1462)
	at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:509)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
	at java.lang.Thread.run(Thread.java:748)

We don't want to change behavior to read frozen indices by default since they will be very slow. We might add some config to have es-hadoop pass in ignore-throttled so that frozen indices can be queried. But since they are deprecated that might not be worth the effort.
Relates #1734

jbaiera

LGTM

Avoiding failure when using frozen indices

c326a2f

masseyke added bug v8.0.0-rc2 v8.1.0 v7.17.0 labels Jan 4, 2022

masseyke requested a review from jbaiera January 4, 2022 22:04

Adding test file that was missed

2776042

jbaiera approved these changes Jan 24, 2022

View reviewed changes

masseyke merged commit 755aa1c into elastic:master Jan 26, 2022

masseyke deleted the fix/exception-on-frozen-indices branch January 26, 2022 22:47

masseyke added the backport pending label Jan 26, 2022

masseyke added a commit to masseyke/elasticsearch-hadoop that referenced this pull request Jan 26, 2022

Avoiding failure when using frozen indices (elastic#1842)

1b0cc72

masseyke added a commit to masseyke/elasticsearch-hadoop that referenced this pull request Jan 26, 2022

Avoiding failure when using frozen indices (elastic#1842)

0febba0

masseyke added a commit that referenced this pull request Jan 27, 2022

Avoiding failure when using frozen indices (#1842) (#1885)

4ffaaf6

masseyke added a commit that referenced this pull request Jan 27, 2022

Avoiding failure when using frozen indices (#1842) (#1886)

0a67122

masseyke removed the backport pending label Jan 27, 2022

This was referenced Jan 28, 2022

Fixing ScrollReaderTest compilation #1887

Merged

Fixing ScrollReaderTest compilation #1888

Merged

masseyke added the :Core label Jan 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoiding failure when using frozen indices #1842

Avoiding failure when using frozen indices #1842

Uh oh!

masseyke commented Jan 4, 2022

Uh oh!

jbaiera left a comment

Uh oh!

Uh oh!

Avoiding failure when using frozen indices #1842

Avoiding failure when using frozen indices #1842

Uh oh!

Conversation

masseyke commented Jan 4, 2022

Uh oh!

jbaiera left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!