Forums
Subject: Must be missing something...
Prev Next
You are not authorized to post a reply.

Author Messages
memondUser is Offline

Posts:19

Send A Message
04/02/2008 10:35 AM  
I installed the latest updates for seamus and search, inserted them on my site (Version 4.6.2). Read the documentation on both (and following the seamus doc, set seamus to load on all pages) and went through and re-ran the indexing. Search results still seem to be restricted to content set up via rss because when I browse other pages on the site I see the Seamus total items indexed number rising but I still don't see any search results being added from those newly indexed pages. What should I be looking into to correct this? I need it to search all my web pages, a child portal, and all my files on the server (mostly pdf's). As an aside, I did notice that when I uncheck the subscribe setting under the search options, it doesn't go away and using an image for the search button didn't work either. I should mention that I am the guy with the site running off an IP however, I did set up a temporary redirect: http://www.manchesternh.gov/website Thanks for your help, Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/02/2008 10:54 AM  
When you say you "re-ran the indexing", you mean you went into Edit Seamus > Execute Commands > Reload Index, correct?

Questions:
1. Are those pages public or protected?
2. Do those pages have the text/html module on them?
3. From Host > SQL what do you get for the following query: select * from venexus_braindump where IndexURL = 'INSERT URL'
4. Are all of the pdf files in the portal root?
memondUser is Offline

Posts:19

Send A Message
04/02/2008 11:28 AM  
Yes that correct, sorry. I re-indexed from within the module menu. 1. I currently do not have windows (iis) securinty turned on although I will need to restore it at some point soon here. I also have the DNN setting the majority of my site pages viewable by all users. 2. Almost all pages use a modification of the text/html module (effority workflow module) - hope that's not the problem as we really depend on its versioning control. 3. I get the message "The query completed successfully" 4. All pdfs are a few directories deep but yes, all are in the portal root folder. Hope this info helps narrow down the issue. Thanks for your help. Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/02/2008 11:43 AM  
"Almost all pages use a modification of the text/html module (effority workflow module) " - Yes that is an issue. We only support core DNN modules that use iPortalble interface for the main dump we do of data. The only way it can index those pages is via a regular crawling/spidering method. Please execute the following statement: select * from venexus_queue qhere queueurl = 'INSERT URL HERE'. Let me know if you get any results for your url.
Jeff SmithUser is Offline

Posts:210

Send A Message
04/02/2008 11:45 AM  
Also, which method are you using to aggregate? Please see: http://www.venexus.com/Support/ProductForums/tabid/132/forumid/2/postid/35/view/topic/Default.aspx
memondUser is Offline

Posts:19

Send A Message
04/02/2008 11:50 AM  
Hi I believe I am using the seamus module to agregate, I wasn't aware that any of the other ones (categories etc.) did that. Or do you mean other 3rd party agregators? I do have one that agregates rss feeds from another company. I also ran the sql statement and got the following, it looks ominus: System.Data.SqlClient.SqlException: Line 1: Incorrect syntax near 'queueurl'. at System.Data.SqlClient.SqlConnection.OnError(SqlException exception, Boolean breakConnection) at System.Data.SqlClient.SqlInternalConnection.OnError(SqlException exception, Boolean breakConnection) at System.Data.SqlClient.TdsParser.ThrowExceptionAndWarning(TdsParserStateObject stateObj) at System.Data.SqlClient.TdsParser.Run(RunBehavior runBehavior, SqlCommand cmdHandler, SqlDataReader dataStream, BulkCopySimpleResultSet bulkCopyHandler, TdsParserStateObject stateObj) at System.Data.SqlClient.SqlCommand.RunExecuteNonQueryTds(String methodName, Boolean async) at System.Data.SqlClient.SqlCommand.InternalExecuteNonQuery(DbAsyncResult result, String methodName, Boolean sendToPipe) at System.Data.SqlClient.SqlCommand.ExecuteNonQuery() at DotNetNuke.Data.SqlDataProvider.ExecuteADOScript(String SQL) at DotNetNuke.Data.SqlDataProvider.ExecuteScript(String Script, Boolean UseTransactions) select * from venexus_queue qhere queueurl = 'INSERT URL HERE'
memondUser is Offline

Posts:19

Send A Message
04/02/2008 12:00 PM  
Sorry, misread your last post. I am using the method where there is an instance of the seamus module on every page and when that page is visited it is added to the index, if I understand correctly. Should I try a different method? If so, which would you suggest? Thanks Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/03/2008 9:54 AM  
I fat fingered that query. Please try this: select * from venexus_queue where queueurl = 'INSERT URL HERE'
memondUser is Offline

Posts:19

Send A Message
04/03/2008 1:48 PM  
Hi, I ran that query and I am getting a lot more search results now. I tested out searching pdfs, still no luck on that but I'm wondering if I have to wait until after the scheduled crawl to pick up those files? Thanks for getting me this far. Do I need to run that script every once and a while to update the search index or is it all set? Thanks Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/04/2008 1:22 PM  
Are there pdf files in the queue? If so, they have to be crawled individually and if you have a lot of them it may take a while before they all are indexed, depending on your traffic. You may consider installing the DNN Scheduled Task we have in our downloads to speed this up.
Jeff SmithUser is Offline

Posts:210

Send A Message
04/04/2008 1:27 PM  
select * from venexus_queue where queueurl like '%.pdf'
memondUser is Offline

Posts:19

Send A Message
04/04/2008 3:14 PM  
I bet by now you are just dreading every time you see a post popping up from me :)
I ran the script and it went fine but didn't seem to pick up anything new that I browsed to / opened.
Also tried installing the scheduler but got the following:
StartJob Starting Installation
StartJob Reading files
Info Loading Venexus.Modules.VenexusScheduledTasks.dll
Info File Venexus.Modules.VenexusScheduledTasks.dll read successfully
Info Loading CreateScheduleTask.txt
Info File CreateScheduleTask.txt read successfully
Failure Did not find any .dnn file ???
Failure ExceptionSystem.Exception: File load failed, aborting at DotNetNuke.Modules.Admin.ResourceInstaller.PaInstaller.ReadZipStream() at DotNetNuke.Modules.Admin.ResourceInstaller.PaInstaller.Install()

Thanks
Mike
memondUser is Offline

Posts:19

Send A Message
04/04/2008 3:16 PM  
(I have about 3,000 pdf files - do I just need to be patient or start browsing all of these?)
Thanks
Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/05/2008 10:56 AM  
When you said "I ran the script and it went fine but didn't seem to pick up anything new that I browsed to / opened. ", you mean there were lots of results? If so, this means these are results that have NOT been crawled....yet.

As for the VenexusScheduledTasks, it is not meant to be installed that way. Just drop the dll file into your bin folder and the text file has the instructions for setting up:
1. Upload Venexus.Modules.VenexusScheduledTasks.dll file to bin folder of your DNN installation.


2. Add this to the the web.config file between <appSettings>
<add key="SeamusWebRequestUrl" value="http://www.domainname.com/DesktopModules/VenexusSeamus/SeamusImage.aspx" />
*******************************************************************************************

Example (This is how it should look)

<appSettings>
<add key="SiteSqlServer" value="Server=ss2005;Database=DotNetNuke;uid=User;pwd=User!;" />
<add key="InstallTemplate" value="DotNetNuke.install.config" />
<add key="AutoUpgrade" value="true" />
<add key="UseInstallWizard" value="true" />
<add key="InstallMemberRole" value="true" />
<add key="ShowMissingKeys" value="false" />
<add key="EnableWebFarmSupport" value="false" />
<add key="EnableCachePersistence" value="false" />
<add key="HostHeader" value="" />
<add key="RemoveAngleBrackets" value="false" />
<add key="PersistentCookieTimeout" value="0" />
<add key="SeamusWebRequestUrl" value="http://www.domainname.com/DesktopModules/VenexusSeamus/SeamusImage.aspx" />
</appSettings>

*******************************************************************************************


Create a scheduled task under Host -> Scheduler

3. Add Item to Schedule

a) Full Class Name and Assembly: Venexus.Modules.VenexusScheduledTasks.SeamusWebRequest, Venexus.Modules.VenexusScheduledTasks
b) Schedule Enabled: Check Yes
c) Time Lapse: (2) Hour
d) Retry Frequency: (30) Minutes
e) Retain Schedule History: None
f) Catch Up Enabled: (Leave Blank)
g) Object Dependencies: (Leave Blank)
h) Run on Servers: (Leave Blank)
Jeff SmithUser is Offline

Posts:210

Send A Message
04/05/2008 10:59 AM  
Since you have so many pdf files, I would recommend lowering the Time Lapse to 5 minutes and Retry Frequency to 10 minutes.
memondUser is Offline

Posts:19

Send A Message
04/07/2008 12:11 PM  
Hi. I should have specified that I ran the script and it gave me a message that it ran without error. It didn't specify anything further though. I did install the scheduler as you instruct above and it is running successfully every 5 minutes now. So I'll give it some time and we'll see if anything improves with the search results. Thanks for your help.
Mike
Jeff SmithUser is Offline

Posts:210

Send A Message
04/08/2008 1:40 PM  

If you are still having trouble getting it to index those documents, please go into Admin > File Manager and Synchronize Files using the recursive option. Sometimes it takes a while for DNN to get those files all into the database and if they are not int here, Seamus will not find them. We have seen that when there are thousands of documents that DNN will fail to fully synchronize the filesyste, due to timeout errors. So, make sure all files appear in folder tree in your File Manager.

You are not authorized to post a reply.
Forums > Modules > Venexus Search > Must be missing something...



 Print   

Top Posts
VenexusBookmarks 4.1.2 Released by tmunn
A new version of  Venexus Bookmarks module is now available for downlo...
VenexusBookmarks 4.1.2 Released by tmunn
A new version of  Venexus Bookmarks module is now available for downlo...
VenexusSeamus 4.1.5 Released by tmunn
A new version of VenexusSeamus, the search engine aggregation module utiliz...
VenexusSeamus 4.1.5 Released by tmunn
A new version of VenexusSeamus, the search engine aggregation module utiliz...
Venexus WebControls 4.1.7 Released by tmunn
A new version of  Venexus WebControls was released today.  This r...
  

 © 2007 - Venexus, Inc. - All rights reserved Terms Of Use | Privacy Statement