If someone uploads a corrupt .mp3, .mp4, or .mov (various other formats as well) file into Ektron as a DMS asset, and you then run a Solr crawl, the file causes the Solr crawl to hang.
This problem is caused by a Tika bug, described in http://jira.xwiki.org/browse/XWIKI-9528.
There are 2 solutions:
- When importing the .mp3 file, uncheck the Content Searchable checkbox. This tells Solr to ignore the file when beginning a crawl.
To access the Content Searchable checkbox, import the file, check it in, then click the Edit Properties toolbar button. - Import the file as a library asset. The Solr crawl ignores library files. See http://documentation.ektron.com/cms400/v9.00/Reference/Web/EktronReferenceWeb.html#Content/Library.htm#addlibraryitem.
Please sign in to leave a comment.