Forums
Subject: Manage Domains: Delete vs. Inactive
Prev Next
You are not authorized to post a reply.

Author Messages
John HenleyUser is Offline

Posts:25

Send A Message
06/15/2008 5:43 PM  

What is the difference between deleting a domain and marking it not active?

John HenleyUser is Offline

Posts:25

Send A Message
06/15/2008 5:53 PM  

Actually, I mean delete vs. not approved. 

My assumption is that if I delete, it could be re-added in a future crawl, but if I uncheck 'approved' it will never be crawled again?

Jeff SmithUser is Online

Posts:210

Send A Message
06/16/2008 12:44 PM  
Deleting a domain, only removes it from the grid. It could be found again and added back, if you have any external links on your site. If global crawler is enabled in VenexusSeamus, it will not crawl any domains that have been marked inactive. Also, if you do not have global crawler enabled, and you add a domain and select Add to Queue, it will be crawled. I suggest you only delete if you removed the link from your site, and any you do not want crawled, mark them not active.
John HenleyUser is Offline

Posts:25

Send A Message
06/17/2008 8:18 AM  
Thanks. That's exactly what I needed to know. The domains are being added by the global crawler, so I wlll just mark them as unapproved.

BTW, when the global crawler adds a domain, sometimes it's added as approved and sometimes not. How is that determined, and is there a way to control it?
Jeff SmithUser is Online

Posts:210

Send A Message
06/17/2008 1:42 PM  

This is the code that adds the new domain:

                If IsNewDomain Then
                            Dim iNewDomainID As Integer = 0
                            Dim objNewDomain As New Venexus.BusinessObjects.VenexusDomain
                            objNewDomain.AddNew()
                            objNewDomain.DomainName = GetDomainName(sURL)
                            objNewDomain.LastRequest = Now()
                            If IsGlobalCrawler() = False Then
                                Dim objSeamus As New VenexusSeamusCollection
                                objSeamus.Query.Where(objSeamus.Query.Url.Equal(sURL))
                                If objSeamus.Query.Load Then
                                    objNewDomain.IsApproved = True
                                Else
                                    objNewDomain.IsApproved = False
                                End If
                        Else
                            objNewDomain.IsApproved = True
                        End If
                            objNewDomain.Save()
                            iNewDomainID = CType(objNewDomain.DomainID, Integer)
                            iDomainID = iNewDomainID
                    End If

                        Dim objNewRobots As New Venexus.BusinessObjects.VenexusRobots
                        objNewRobots.AddNew()
                        objNewRobots.DomainID = iDomainID
                        objNewRobots.LastUpdate = Now()
                        objNewRobots.RobotsText = sRobots
                        objNewRobots.Save()

If it is NOT a GloablCrawler, it checks to see if a feed exist. If it does exist, it marks it approved. If it does not, it marks in not approved.

If it is GlobalCrawler, then it is automatically approved, always. However, the robots.txt file is also checked. If the robots.txt disallows it to be crawled, it will automatically change it to NOT approved. Maybe your cases have a robots.txt file for the domain that disallows it to be crawled?

 

John HenleyUser is Offline

Posts:25

Send A Message
06/18/2008 9:45 PM  
The domains I'm wanting to disallow are not my own, like apple.com, google.com, etc. so I don't want to spider them. If I mark a domain as not approved, what happens to any existing, previously-spidered items for that domain--do they get removed?
Jeff SmithUser is Online

Posts:210

Send A Message
06/24/2008 1:40 PM  
with the new version they will get removed from queue and index.
You are not authorized to post a reply.
Forums > Modules > Venexus Seamus > Manage Domains: Delete vs. Inactive



 Print   

Top Posts
Venexus WebControls 4.1.7 Released by tmunn
A new version of  Venexus WebControls was released today.  This r...
Venexus WebControls 4.1.7 Released by tmunn
A new version of  Venexus WebControls was released today.  This r...
VenexusProfileSearch 4.1.7 Released by tmunn
The Venexus Profile Search module has been updated.  Make sure to upda...
VenexusProfileSearch 4.1.6 Released by tmunn
The Venexus Profile Search module has been updated.  Make sure to upda...
VenexusGroups 4.1.5 Released by tmunn
A new version of VenexusGroups has been released.  Make sure to update...
  

 © 2007 - Venexus, Inc. - All rights reserved Terms Of Use | Privacy Statement