Class URLFrontierGrpc.URLFrontierFutureStub

    • Method Detail

      • listNodes

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.StringList> listNodes​(Urlfrontier.Empty request)
         * Return the list of nodes forming the cluster the current node belongs to *
         
      • listCrawls

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.StringList> listCrawls​(Urlfrontier.Local request)
         * Return the list of crawls handled by the frontier(s) *
         
      • listQueues

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.QueueList> listQueues​(Urlfrontier.Pagination request)
         * Return a list of queues for a specific crawl. Can chose whether to include inactive queues (a queue is active if it has URLs due for fetching);
         by default the service will return up to 100 results from offset 0 and exclude inactive queues.*
         
      • getStats

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.Stats> getStats​(Urlfrontier.QueueWithinCrawlParams request)
         * Return stats for a specific queue or an entire crawl. Does not aggregate the stats across different crawlids. *
         
      • blockQueueUntil

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.Empty> blockQueueUntil​(Urlfrontier.BlockQueueParams request)
         * Block a queue from sending URLs; the argument is the number of seconds of UTC time since Unix epoch
         1970-01-01T00:00:00Z. The default value of 0 will unblock the queue. The block will get removed once the time
         indicated in argument is reached. This is useful for cases where a server returns a Retry-After for instance.
         
      • setActive

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.Empty> setActive​(Urlfrontier.Active request)
         * De/activate the crawl. GetURLs will not return anything until SetActive is set to true. PutURLs will still take incoming data. *
         
      • getActive

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.Boolean> getActive​(Urlfrontier.Local request)
         * Returns true if the crawl is active, false if it has been deactivated with SetActive(Boolean) *
         
      • setDelay

        public com.google.common.util.concurrent.ListenableFuture<Urlfrontier.Empty> setDelay​(Urlfrontier.QueueDelayParams request)
         * Set a delay from a given queue.
         No URLs will be obtained via GetURLs for this queue until the number of seconds specified has
         elapsed since the last time URLs were retrieved.
         Usually informed by the delay setting of robots.txt.