2017-08-16 33 views
0

ATG - Endeca Baseline更新失敗,ART應用程序中出現以下 錯誤。但部分索引是成功的。Endeca - 基線更新失敗 - 從記錄存儲中讀取錯誤

附加CAS日誌也用於corressponding錯誤。

Aug 15, 2017 12:23:37 PM com.endeca.soleng.eac.toolkit.script.Script runBeanShellScript 
SEVERE: Crawl 'ART-last-mile-crawl' failed with error: Problem running full acquisition on data source for ART-last-mile-crawl: Error reading from Record Store ART-data: malformed input around byte 10. 
Occurred while executing line 11 of valid BeanShell script: 
[[ 

8|  Dgidx.cleanDirs(); 
9| 
10|  // run crawl and archive any changes in dvalId mappings 
11|  CAS.runBaselineCasCrawl("ART-last-mile-crawl"); 
12|  CAS.archiveDvalIdMappingsForCrawlIfChanged("ART-last-mile-crawl"); 
13| 
14|  // archive logs and run the indexer 

]] 

Aug 15, 2017 12:23:37 PM com.endeca.soleng.eac.toolkit.Controller execute 
SEVERE: Caught an exception while invoking method 'run' on object 'BaselineUpdate'. Releasing locks. 
java.lang.reflect.InvocationTargetException 
     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
     at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 

在CAS的錯誤日誌:

2017-08-15 12:23:36,485 ERROR [ART-data] [cas-ART-last-mile-crawl-worker-1] com.endeca.itl.recordstore.impl.RecordStoreImpl: Error executing method RecordStoreImpl.readRecords() 
com.endeca.itl.recordstore.RecordStoreException: malformed input around byte 10 
     at com.endeca.itl.recordstore.impl.ReadCursor.read(ReadCursor.java:81) 
     at com.endeca.itl.recordstore.impl.RecordStoreImpl.readRecords(RecordStoreImpl.java:480) 
     at sun.reflect.GeneratedMethodAccessor106.invoke(Unknown Source) 
     at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) 
     at java.lang.reflect.Method.invoke(Method.java:497) 
     at com.endeca.itl.service.ServicePublisher$1.invoke(ServicePublisher.java:121) 
     at com.sun.proxy.$Proxy57.readRecords(Unknown Source) 
     at com.endeca.itl.recordstore.RecordStoreReader.fetchNextChunk(RecordStoreReader.java:267) 
     at com.endeca.itl.recordstore.RecordStoreReader.hasNext(RecordStoreReader.java:244) 
     at com.endeca.itl.extension.source.merger.RecordStoreMergerDataSourceRuntime$RecordStoreReadSession.runFull(RecordStoreMergerDataSourceRuntime.java:252) 
     at com.endeca.itl.extension.source.merger.RecordStoreMergerDataSourceRuntime.runFullAcquisition(RecordStoreMergerDataSourceRuntime.java:148) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime$2.doWork(CasExtensionRegistry.java:220) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime$2.doWork(CasExtensionRegistry.java:218) 
     at com.endeca.itl.plugin.ThreadContextRunner.run(ThreadContextRunner.java:136) 
     at com.endeca.itl.plugin.ThreadContextRunner.run(ThreadContextRunner.java:89) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime.runFullAcquisition(CasExtensionRegistry.java:218) 
     at com.endeca.itl.executor.extension.ExtensionDataSourceProcessor.processRecord(ExtensionDataSourceProcessor.java:104) 
     at com.endeca.itl.executor.extension.IncrementalDataSourceProcessor.processRecord(IncrementalDataSourceProcessor.java:106) 
     at com.endeca.itl.executor.TaskManager$2.work(TaskManager.java:166) 
     at com.endeca.itl.executor.WorkExecutor$WorkRunnable.run(WorkExecutor.java:194) 
     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
     at java.lang.Thread.run(Thread.java:745) 
     at com.endeca.itl.util.LoggingContextAwareThread.run(LoggingContextAwareThread.java:71) 
Caused by: java.io.UTFDataFormatException: malformed input around byte 10 
     at java.io.DataInputStream.readUTF(DataInputStream.java:656) 
     at java.io.DataInputStream.readUTF(DataInputStream.java:564) 
     at com.endeca.itl.recordstore.impl.storage.RecordStorageEntry.load(RecordStorageEntry.java:114) 

如何分流和調試進一步將是有益的任何輸入

+0

看起來像損壞的數據,不能使用UTF-8轉換。是在這個環境中工作過的基線還是最近開始的? –

+0

感謝@AjayAgrawal的迴應。直到有一天它工作良好。 ./recordstore-cmd.sh read-baseline -a ART-data -c即使記錄計數也會產生相同的錯誤。 「退出錯誤,失敗,字節10周圍的輸入格式錯誤」 –

+0

我會建議初始化服務並查看它是否有效。似乎像記錄存儲損壞.. –

回答

2

上述錯誤是完成以下後解決腳步。

  1. 第1步 - 在添加以下更改後導出和導入CAS配置。 <ignoreInvalidRecords>true</ignoreInvalidRecords>

    recordstore-cmd.sh get-configuration -a ART-data -f dataConfig.xml 
    recordstore-cmd.sh set-configuration -a ART-data -f dataConfig.xml 
    
  2. 第二步 - 相同的變化,在本記錄存儲配置文件進行下<CAS_WS>/workspace/state/ART-data

    <?xml version="1.0" encoding="UTF-8" standalone="yes"?> 
    <recordStoreConfiguration xmlns="http://recordstore.itl.endeca.com/"> 
        <changePropertyNames/> 
        <idPropertyName>record.id</idPropertyName> 
        <ignoreInvalidRecords>true</ignoreInvalidRecords><!-- newly added --> 
        <jdbmSettings/> 
    </recordStoreConfiguration> 
    
  3. 步驟3 - 將cas_output<Endeca_apps>/ART/data/cas_output用備份替換(2天過去)那是可用的。

經過上述步驟,索引是直接從後端啓動(通過調用腳本)。一旦這個索引成功,索引就從發電機中調用,並且成功。