We are still actively working on the spam issue.

Difference between revisions of "Search engines"

From InstallGentoo Wiki
Jump to: navigation, search
(Added info about !bang syntax)
(Add stub template, expand article)
 
(49 intermediate revisions by 18 users not shown)
Line 1: Line 1:
__TOC__
+
{{cleanup|Does not provide an easy understanding of what a Search Engine is and instead provides external links regarding search engines}}
  
[[File:Googsearchtweaks.png|thumb|Google search tweaks. [http://insidesearch.blogspot.in/2012/08/an-update-to-our-search-algorithms.html]]]
+
[[File:Snowden on Search Engines.png|thumb|[[Edward Snowden]] knows what's going on with Search Engines]]
  
= Google =
+
A '''Search Engine''' is a form of web indexing which allows you to search by keyword to a database of saved websites. Prior to the rise of [[Google]], the first Search Engine, websites were found through categorized databases known as '''Linkbays'''. In the time since, Linkbays have died out and Search Engines are an expected part of the experience of the [[World Wide Web]].
  
Google is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s.
+
[[Google]], the original Search Engine, remains the most popular Search Engine on the net. However, Google (and many other websites) have begun to self-cripple their own services advanced features, including the ability to search by date and specifically by site. Google itself has (as of 2021) removed support for searching for websites farther-back than 5 years ago. DuckDuckGo no longer respects searching by date alltogether. It is believed that this is not only a push towards censorship, but also moving towards a model where Search Engines are sole arbitors of truth and only accept questions rather than keywords so answers can be provided by the Search Engine. This is an ongoing process and immediately observable phenomenon.
 +
 
 +
== Learn search syntax (search operators): ==
 +
* [https://help.duckduckgo.com/duckduckgo-help-pages/results/syntax/ DDG tutorial]
 +
* [https://support.google.com/websearch/answer/2466433 G tutorial]
 +
 
 +
=== Ddg's bangs are useless ===
 +
* [https://addons.mozilla.org/en-US/firefox/addon/add-custom-search-engine/ In -fox.] You can add most search engines easily without extensions, just enable search bar in your settings and click on the green circle whenever you feel like it, then go to settings and add a search shortcut.
 +
* In vanilla chromium just right-click your adress bar.
 +
 
 +
== What does /g/ use? ==
 +
/g/ doesn't use (((search engines))), search engine things that are worth mentioning:
 +
* [https://wiby.me/ wiby]
 +
* YaCy - p2p SE.
 +
* SearX - software-not-service metasearch engine. Detailed instances list: [https://searx.space/ searx.space]
 +
Big regional search engines:
 +
* Yahoo.co.jp
 +
* Yandex - also better than Guugle at image searches!
 +
* Naver
 +
* Baidu
 +
 
 +
== Google ==
 +
[[File:Googsearchtweaks.png|left|thumb|Google search tweaks. [http://insidesearch.blogspot.in/2012/08/an-update-to-our-search-algorithms.html]]]
 +
[[Google]] is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s.
  
 
Criticism for Google falls under the following main categories:
 
Criticism for Google falls under the following main categories:
  
<ul>
+
* Censorship of results
<li><p>Censorship of results</p>
+
** Google has often adopted positions which have pissed off /g/entoomen. These range from pushing down search results which link to [http://insidesearch.blogspot.in/2012/08/an-update-to-our-search-algorithms.html sites which have received DMCA notices,] to making the [https://www.youtube.com/watch?v=yuqPeNtO7xI SafeSearch mandatory to all image searches.]
<p>Google has often adopted positions which have pissed off /g/entoomen. These range from pushing down search results which link to [http://insidesearch.blogspot.in/2012/08/an-update-to-our-search-algorithms.html sites which have received DMCA notices,] to making the [https://www.youtube.com/watch?v=yuqPeNtO7xI SafeSearch mandatory to all image searches.]</p></li>
 
<li><p>Tailored results</p>
 
<p>While many people think getting tailored search results is a wonderful thing, rms has this to say on the topic:</p>
 
<p>“I find Google’s argument,”The better to serve you with my dear,&quot; to be an insult to our intelligence.&quot; [http://stallman.org/cgi-bin/showpage.cgi?path=/archives/2012-jan-apr.html&term=Google&type=norm&case=0 Sauce]</p></li>
 
<li><p>Privacy issues</p>
 
<p>That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [http://www.google.com/transparencyreport/ Google Transparency Report] for more information. Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of [https://en.wikipedia.org/wiki/Eric_Schmidt#Views controversial statements] in the past. One example: '''“I think judgment matters. If you have something that you don’t want anyone to know, ''maybe you shouldn’t be doing it in the first place'', but if you really need that kind of privacy, the reality is that search engines including Google do retain this information for some time, and it’s important, for example, that we are all subject in the United States to the [https://en.wikipedia.org/wiki/USA_PATRIOT_Act Patriot Act.] It is possible that, that information could be made available to the authorities”''' - when asked whether people should treat Google like a trusted friend.</p></li></ul>
 
  
= [https://ddg.gg/ DuckDuckGo] =
+
* Tailored results
 +
** While many people think getting tailored search results is a wonderful thing, rms has this to say on the topic: “I find Google’s argument,”The better to serve you with my dear,&quot; to be an insult to our intelligence.&quot; [http://stallman.org/cgi-bin/showpage.cgi?path=/archives/2012-jan-apr.html&term=Google&type=norm&case=0 Sauce]
  
DuckDuckGo is the go-to search engine for people fearing for their privacy from larger engines such as Google. It is preferred by some people because it respects your freedom, meaning it doesn’t track your search history and doesn’t bubble you for personalized searches and advertisements. DDG, as fans often call it, also has very cool search features - called [https://duckduckgo.com/goodies Goodies] - like math, programming, music, cryptography among others. DuckDuckGo can however be difficult to find results for slightly obscure keywords.
+
* Privacy issues
 +
** That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [https://www.seordp.org/] for more information.
 +
** Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of [[Wikipedia:Eric_Schmidt#Views |controversial statements]] in the past. One example: “I think judgment matters. If you have something that you don’t want anyone to know, ''maybe you shouldn’t be doing it in the first place'', but if you really need that kind of privacy, the reality is that search engines including Google do retain this information for some time, and it’s important, for example, that we are all subject in the United States to the [[Wikipedia:USA_PATRIOT_Act |Patriot Act]]. It is possible that, that information could be made available to the authorities” - when asked whether people should treat Google like a trusted friend.
 +
 
 +
== DuckDuckGo ==
 +
 
 +
[https://www.duckduckgo.com/html/ DuckDuckGo] is the go-to search engine for people fearing for their privacy from larger engines such as Google. It is preferred by some people because it respects your freedom, meaning it doesn’t track your search history and doesn’t bubble you for personalized searches and advertisements. DDG, as fans often call it, also has very cool search features&nbsp;&ndash;&nbsp;called [https://duckduckgo.com/goodies Goodies]&nbsp;&ndash;&nbsp;like math, programming, music, cryptography among others.
  
 
DuckDuckGo has integration into several other search engines with the [https://duckduckgo.com/bang.html !bang syntax]. Examples includes StartPage and StartPage Images, which you can search by prefixing your query with !s or !spi respectively.  
 
DuckDuckGo has integration into several other search engines with the [https://duckduckgo.com/bang.html !bang syntax]. Examples includes StartPage and StartPage Images, which you can search by prefixing your query with !s or !spi respectively.  
  
Another reason anons like DDG is that it has ads on 4chan which help support the cash-strapped website.
+
Another reason anons like DDG is that it has ads for 4chan which helps support the cash-strapped website.
 +
 
 +
The [[8chan]] [[/tech/]] board lists a number of [https://8ch.net/tech/ddg.html reasons] to at least be ''suspicious'' about DuckDuckGo.  However, [[Richard Mathew Stallman|RMS]] uses it when he needs to search something, and the people who represent DuckDuckGo claim that the reasons listed were either mistakes, or irrelevant.
 +
 
 +
== ixquick ==
 +
 
 +
[https://ixquick.com/  ixquick] was a meta search engine with focus on privacy. It was merged with startpage.
 +
 
 +
== MetaGer ==
 +
''experimental English support''
 +
 
 +
[https://metager.de/en/ MetaGer] was created by angry Germans, who don't want the NSA to know they're looking for Sauerkraut. Like startpage, it is a meta search engine focused on privacy. Its results come from Bing. Its income is through ads, served based on search terms.
 +
 
 +
== Bing ==
 +
 
 +
[https://www.bing.com/ Bing] is generally considered to be right on par with Google, albeit far less popular in usage. Lately, however, anons have started preferring Bing over Google when Google started censoring adult content even for explicit search terms.
 +
 
 +
If you think you can trust Micro$hit over Google, please discontinue reading /g/ and fucking kill yourself.
 +
 
 +
== Startpage ==
 +
 
 +
[https://startpage.com/ Startpage] is not a new search engine per se. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox. Note that Startpage is partially owned by an [https://restoreprivacy.com/startpage-system1-privacy-one-group/ advertisement company]
 +
 
 +
==== Setting Startpage as a search engine ====
 +
What is given to you by Startpage's website won't work, so use this link in the third box when adding it as a search engine: https://startpage.com/do/search?query=%s&cat=web&pl=chrome&language=english
 +
Alternatively, you would be better off using a locally hosted page.
 +
 
 +
== Searx ==
 +
 
 +
[https://searx.me/ Searx] is an open source [https://en.wikipedia.org/wiki/Metasearch_engine metasearch engine].
 +
It returns anonymized results from other search engines like google, bing or startpage without tracking its users.
 +
Searx queries are made using [http://www.w3schools.com/tags/ref_httpmethods.asp POST requests] so that they don't show up on logs or search history. On top of that Searx is [https://github.com/asciimoo/searx open source] and self-hostable. It has a hidden service. Here is a list of [https://searx.space instances] [https://asciimoo.github.io/searx/user/own-instance.html Public instances] could possibly become "rogue" and log user activity, similar to Tor nodes being hijacked. Also note that Searx has been blacklisted by both Google, and performance results may vary.
 +
 
 +
[[Category:Recommendations]]
 +
[[Category:What does /g/ use?]]
 +
 
 +
== YaCy ==
 +
 
 +
[https://yacy.net/ YaCy] is one of the oldest P2P search engines, making it completely decentralized without a central log server.
  
= Bing =
+
= Experimental search engines =
  
Bing is generally considered to be right on par with Google. Lately, however, anons have started preferring Bing over Google when Google started censoring adult content even for explicit search terms.
+
* [https://www.yippy.com Yippy] - metasearch engine that groups results in clusters.
  
= Startpage =
+
= Lurk more =
  
Startpage is not a new search engine into itself. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox.
+
* https://www.searchenginefinder.com

Latest revision as of 06:20, 20 February 2022

Cleanup.png
Cleanup.png
CLEANUP CANDIDATE
Relevant discussion may be found on the talk page. Reason: Does not provide an easy understanding of what a Search Engine is and instead provides external links regarding search engines


Edward Snowden knows what's going on with Search Engines

A Search Engine is a form of web indexing which allows you to search by keyword to a database of saved websites. Prior to the rise of Google, the first Search Engine, websites were found through categorized databases known as Linkbays. In the time since, Linkbays have died out and Search Engines are an expected part of the experience of the World Wide Web.

Google, the original Search Engine, remains the most popular Search Engine on the net. However, Google (and many other websites) have begun to self-cripple their own services advanced features, including the ability to search by date and specifically by site. Google itself has (as of 2021) removed support for searching for websites farther-back than 5 years ago. DuckDuckGo no longer respects searching by date alltogether. It is believed that this is not only a push towards censorship, but also moving towards a model where Search Engines are sole arbitors of truth and only accept questions rather than keywords so answers can be provided by the Search Engine. This is an ongoing process and immediately observable phenomenon.

Learn search syntax (search operators):

Ddg's bangs are useless

  • In -fox. You can add most search engines easily without extensions, just enable search bar in your settings and click on the green circle whenever you feel like it, then go to settings and add a search shortcut.
  • In vanilla chromium just right-click your adress bar.

What does /g/ use?

/g/ doesn't use (((search engines))), search engine things that are worth mentioning:

  • wiby
  • YaCy - p2p SE.
  • SearX - software-not-service metasearch engine. Detailed instances list: searx.space

Big regional search engines:

  • Yahoo.co.jp
  • Yandex - also better than Guugle at image searches!
  • Naver
  • Baidu

Google

Google search tweaks. [1]

Google is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s.

Criticism for Google falls under the following main categories:

  • Tailored results
    • While many people think getting tailored search results is a wonderful thing, rms has this to say on the topic: “I find Google’s argument,”The better to serve you with my dear," to be an insult to our intelligence." Sauce
  • Privacy issues
    • That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [2] for more information.
    • Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of controversial statements in the past. One example: “I think judgment matters. If you have something that you don’t want anyone to know, maybe you shouldn’t be doing it in the first place, but if you really need that kind of privacy, the reality is that search engines including Google do retain this information for some time, and it’s important, for example, that we are all subject in the United States to the Patriot Act. It is possible that, that information could be made available to the authorities” - when asked whether people should treat Google like a trusted friend.

DuckDuckGo

DuckDuckGo is the go-to search engine for people fearing for their privacy from larger engines such as Google. It is preferred by some people because it respects your freedom, meaning it doesn’t track your search history and doesn’t bubble you for personalized searches and advertisements. DDG, as fans often call it, also has very cool search features – called Goodies – like math, programming, music, cryptography among others.

DuckDuckGo has integration into several other search engines with the !bang syntax. Examples includes StartPage and StartPage Images, which you can search by prefixing your query with !s or !spi respectively.

Another reason anons like DDG is that it has ads for 4chan which helps support the cash-strapped website.

The 8chan /tech/ board lists a number of reasons to at least be suspicious about DuckDuckGo. However, RMS uses it when he needs to search something, and the people who represent DuckDuckGo claim that the reasons listed were either mistakes, or irrelevant.

ixquick

ixquick was a meta search engine with focus on privacy. It was merged with startpage.

MetaGer

experimental English support

MetaGer was created by angry Germans, who don't want the NSA to know they're looking for Sauerkraut. Like startpage, it is a meta search engine focused on privacy. Its results come from Bing. Its income is through ads, served based on search terms.

Bing

Bing is generally considered to be right on par with Google, albeit far less popular in usage. Lately, however, anons have started preferring Bing over Google when Google started censoring adult content even for explicit search terms.

If you think you can trust Micro$hit over Google, please discontinue reading /g/ and fucking kill yourself.

Startpage

Startpage is not a new search engine per se. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox. Note that Startpage is partially owned by an advertisement company

Setting Startpage as a search engine

What is given to you by Startpage's website won't work, so use this link in the third box when adding it as a search engine: https://startpage.com/do/search?query=%s&cat=web&pl=chrome&language=english Alternatively, you would be better off using a locally hosted page.

Searx

Searx is an open source metasearch engine. It returns anonymized results from other search engines like google, bing or startpage without tracking its users. Searx queries are made using POST requests so that they don't show up on logs or search history. On top of that Searx is open source and self-hostable. It has a hidden service. Here is a list of instances Public instances could possibly become "rogue" and log user activity, similar to Tor nodes being hijacked. Also note that Searx has been blacklisted by both Google, and performance results may vary.

YaCy

YaCy is one of the oldest P2P search engines, making it completely decentralized without a central log server.

Experimental search engines

  • Yippy - metasearch engine that groups results in clusters.

Lurk more