We are still actively working on the spam issue.
Difference between revisions of "Search engines"
(→DuckDuckGo: fixed wrong template usage) |
(Add stub template, expand article) |
||
(28 intermediate revisions by 6 users not shown) | |||
Line 1: | Line 1: | ||
− | [[File: | + | {{cleanup|Does not provide an easy understanding of what a Search Engine is and instead provides external links regarding search engines}} |
+ | |||
+ | [[File:Snowden on Search Engines.png|thumb|[[Edward Snowden]] knows what's going on with Search Engines]] | ||
+ | |||
+ | A '''Search Engine''' is a form of web indexing which allows you to search by keyword to a database of saved websites. Prior to the rise of [[Google]], the first Search Engine, websites were found through categorized databases known as '''Linkbays'''. In the time since, Linkbays have died out and Search Engines are an expected part of the experience of the [[World Wide Web]]. | ||
+ | |||
+ | [[Google]], the original Search Engine, remains the most popular Search Engine on the net. However, Google (and many other websites) have begun to self-cripple their own services advanced features, including the ability to search by date and specifically by site. Google itself has (as of 2021) removed support for searching for websites farther-back than 5 years ago. DuckDuckGo no longer respects searching by date alltogether. It is believed that this is not only a push towards censorship, but also moving towards a model where Search Engines are sole arbitors of truth and only accept questions rather than keywords so answers can be provided by the Search Engine. This is an ongoing process and immediately observable phenomenon. | ||
+ | |||
+ | == Learn search syntax (search operators): == | ||
+ | * [https://help.duckduckgo.com/duckduckgo-help-pages/results/syntax/ DDG tutorial] | ||
+ | * [https://support.google.com/websearch/answer/2466433 G tutorial] | ||
+ | |||
+ | === Ddg's bangs are useless === | ||
+ | * [https://addons.mozilla.org/en-US/firefox/addon/add-custom-search-engine/ In -fox.] You can add most search engines easily without extensions, just enable search bar in your settings and click on the green circle whenever you feel like it, then go to settings and add a search shortcut. | ||
+ | * In vanilla chromium just right-click your adress bar. | ||
== What does /g/ use? == | == What does /g/ use? == | ||
− | * | + | /g/ doesn't use (((search engines))), search engine things that are worth mentioning: |
− | * | + | * [https://wiby.me/ wiby] |
− | * | + | * YaCy - p2p SE. |
− | * | + | * SearX - software-not-service metasearch engine. Detailed instances list: [https://searx.space/ searx.space] |
+ | Big regional search engines: | ||
+ | * Yahoo.co.jp | ||
+ | * Yandex - also better than Guugle at image searches! | ||
+ | * Naver | ||
+ | * Baidu | ||
== Google == | == Google == | ||
− | + | [[File:Googsearchtweaks.png|left|thumb|Google search tweaks. [http://insidesearch.blogspot.in/2012/08/an-update-to-our-search-algorithms.html]]] | |
[[Google]] is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s. | [[Google]] is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s. | ||
Line 20: | Line 39: | ||
* Privacy issues | * Privacy issues | ||
− | ** That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [ | + | ** That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [https://www.seordp.org/] for more information. |
− | ** Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of [[Wikipedia:Eric_Schmidt#Views |controversial statements]] in the past. One example: | + | ** Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of [[Wikipedia:Eric_Schmidt#Views |controversial statements]] in the past. One example: “I think judgment matters. If you have something that you don’t want anyone to know, ''maybe you shouldn’t be doing it in the first place'', but if you really need that kind of privacy, the reality is that search engines including Google do retain this information for some time, and it’s important, for example, that we are all subject in the United States to the [[Wikipedia:USA_PATRIOT_Act |Patriot Act]]. It is possible that, that information could be made available to the authorities” - when asked whether people should treat Google like a trusted friend. |
== DuckDuckGo == | == DuckDuckGo == | ||
Line 35: | Line 54: | ||
== ixquick == | == ixquick == | ||
− | [https://ixquick.com/ ixquick] | + | [https://ixquick.com/ ixquick] was a meta search engine with focus on privacy. It was merged with startpage. |
== MetaGer == | == MetaGer == | ||
''experimental English support'' | ''experimental English support'' | ||
− | [https://metager.de/en/ MetaGer] was created by angry Germans, who don't want the NSA to know they're looking for Sauerkraut. Like | + | [https://metager.de/en/ MetaGer] was created by angry Germans, who don't want the NSA to know they're looking for Sauerkraut. Like startpage, it is a meta search engine focused on privacy. Its results come from Bing. Its income is through ads, served based on search terms. |
== Bing == | == Bing == | ||
Line 50: | Line 69: | ||
== Startpage == | == Startpage == | ||
− | [https://startpage.com/ Startpage] is not a new search engine per se. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox. | + | [https://startpage.com/ Startpage] is not a new search engine per se. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox. Note that Startpage is partially owned by an [https://restoreprivacy.com/startpage-system1-privacy-one-group/ advertisement company] |
+ | |||
+ | ==== Setting Startpage as a search engine ==== | ||
+ | What is given to you by Startpage's website won't work, so use this link in the third box when adding it as a search engine: https://startpage.com/do/search?query=%s&cat=web&pl=chrome&language=english | ||
+ | Alternatively, you would be better off using a locally hosted page. | ||
== Searx == | == Searx == | ||
− | [https://searx.me/ Searx] is an | + | [https://searx.me/ Searx] is an open source [https://en.wikipedia.org/wiki/Metasearch_engine metasearch engine]. |
− | It returns anonymized results from | + | It returns anonymized results from other search engines like google, bing or startpage without tracking its users. |
− | Searx queries are made using [http://www.w3schools.com/tags/ref_httpmethods.asp POST requests] so that they don't show up on logs or search history. On top of that Searx is [https://github.com/asciimoo/searx | + | Searx queries are made using [http://www.w3schools.com/tags/ref_httpmethods.asp POST requests] so that they don't show up on logs or search history. On top of that Searx is [https://github.com/asciimoo/searx open source] and self-hostable. It has a hidden service. Here is a list of [https://searx.space instances] [https://asciimoo.github.io/searx/user/own-instance.html Public instances] could possibly become "rogue" and log user activity, similar to Tor nodes being hijacked. Also note that Searx has been blacklisted by both Google, and performance results may vary. |
− | It | ||
[[Category:Recommendations]] | [[Category:Recommendations]] | ||
− | [[Category:What does /g/ use?]]] | + | [[Category:What does /g/ use?]] |
+ | |||
+ | == YaCy == | ||
+ | |||
+ | [https://yacy.net/ YaCy] is one of the oldest P2P search engines, making it completely decentralized without a central log server. | ||
+ | |||
+ | = Experimental search engines = | ||
+ | |||
+ | * [https://www.yippy.com Yippy] - metasearch engine that groups results in clusters. | ||
+ | |||
+ | = Lurk more = | ||
+ | |||
+ | * https://www.searchenginefinder.com |
Latest revision as of 06:20, 20 February 2022
A Search Engine is a form of web indexing which allows you to search by keyword to a database of saved websites. Prior to the rise of Google, the first Search Engine, websites were found through categorized databases known as Linkbays. In the time since, Linkbays have died out and Search Engines are an expected part of the experience of the World Wide Web.
Google, the original Search Engine, remains the most popular Search Engine on the net. However, Google (and many other websites) have begun to self-cripple their own services advanced features, including the ability to search by date and specifically by site. Google itself has (as of 2021) removed support for searching for websites farther-back than 5 years ago. DuckDuckGo no longer respects searching by date alltogether. It is believed that this is not only a push towards censorship, but also moving towards a model where Search Engines are sole arbitors of truth and only accept questions rather than keywords so answers can be provided by the Search Engine. This is an ongoing process and immediately observable phenomenon.
Contents
Learn search syntax (search operators):
Ddg's bangs are useless
- In -fox. You can add most search engines easily without extensions, just enable search bar in your settings and click on the green circle whenever you feel like it, then go to settings and add a search shortcut.
- In vanilla chromium just right-click your adress bar.
What does /g/ use?
/g/ doesn't use (((search engines))), search engine things that are worth mentioning:
- wiby
- YaCy - p2p SE.
- SearX - software-not-service metasearch engine. Detailed instances list: searx.space
Big regional search engines:
- Yahoo.co.jp
- Yandex - also better than Guugle at image searches!
- Naver
- Baidu
Google is probably the most widely used search engine. However, many on /g/ fear that Google is a botnet, mainly due to the data harvesting the Chrome browser and Google search engine utilizes. Despite this, Google is still one of the best search engines, mainly due to its reverse image search feature. Other reverse image search engines such as TinyEye pale in comparison to Google’s.
Criticism for Google falls under the following main categories:
- Censorship of results
- Google has often adopted positions which have pissed off /g/entoomen. These range from pushing down search results which link to sites which have received DMCA notices, to making the SafeSearch mandatory to all image searches.
- Tailored results
- While many people think getting tailored search results is a wonderful thing, rms has this to say on the topic: “I find Google’s argument,”The better to serve you with my dear," to be an insult to our intelligence." Sauce
- Privacy issues
- That Google tracks user searches and online behavior is no secret. What makes this worse is the fact that Google often shares this information with governments which request it. In Google’s defence, it must be said that they usually follow the law, and do not comply with requests which do not meet the law. Please refer to the [2] for more information.
- Besides, Google has often shown itself to be opposed to online anonymity, and privacy in general. Its CEO Eric Schmidt has had plenty of controversial statements in the past. One example: “I think judgment matters. If you have something that you don’t want anyone to know, maybe you shouldn’t be doing it in the first place, but if you really need that kind of privacy, the reality is that search engines including Google do retain this information for some time, and it’s important, for example, that we are all subject in the United States to the Patriot Act. It is possible that, that information could be made available to the authorities” - when asked whether people should treat Google like a trusted friend.
DuckDuckGo
DuckDuckGo is the go-to search engine for people fearing for their privacy from larger engines such as Google. It is preferred by some people because it respects your freedom, meaning it doesn’t track your search history and doesn’t bubble you for personalized searches and advertisements. DDG, as fans often call it, also has very cool search features – called Goodies – like math, programming, music, cryptography among others.
DuckDuckGo has integration into several other search engines with the !bang syntax. Examples includes StartPage and StartPage Images, which you can search by prefixing your query with !s or !spi respectively.
Another reason anons like DDG is that it has ads for 4chan which helps support the cash-strapped website.
The 8chan /tech/ board lists a number of reasons to at least be suspicious about DuckDuckGo. However, RMS uses it when he needs to search something, and the people who represent DuckDuckGo claim that the reasons listed were either mistakes, or irrelevant.
ixquick
ixquick was a meta search engine with focus on privacy. It was merged with startpage.
MetaGer
experimental English support
MetaGer was created by angry Germans, who don't want the NSA to know they're looking for Sauerkraut. Like startpage, it is a meta search engine focused on privacy. Its results come from Bing. Its income is through ads, served based on search terms.
Bing
Bing is generally considered to be right on par with Google, albeit far less popular in usage. Lately, however, anons have started preferring Bing over Google when Google started censoring adult content even for explicit search terms.
If you think you can trust Micro$hit over Google, please discontinue reading /g/ and fucking kill yourself.
Startpage
Startpage is not a new search engine per se. Rather, it takes your search query, and returns anonymized Google search results. This way, you get Google search results, but Google doesn’t get to know who you are. Startpage can also be combined with the Ixquick proxy. On the Startpage search results page, a ‘View by Ixquick Proxy’ option can be used to visit the search result with a proxy. Startpage has SSL and HTTPS add-ons for Mozilla Firefox. Note that Startpage is partially owned by an advertisement company
Setting Startpage as a search engine
What is given to you by Startpage's website won't work, so use this link in the third box when adding it as a search engine: https://startpage.com/do/search?query=%s&cat=web&pl=chrome&language=english Alternatively, you would be better off using a locally hosted page.
Searx
Searx is an open source metasearch engine. It returns anonymized results from other search engines like google, bing or startpage without tracking its users. Searx queries are made using POST requests so that they don't show up on logs or search history. On top of that Searx is open source and self-hostable. It has a hidden service. Here is a list of instances Public instances could possibly become "rogue" and log user activity, similar to Tor nodes being hijacked. Also note that Searx has been blacklisted by both Google, and performance results may vary.
YaCy
YaCy is one of the oldest P2P search engines, making it completely decentralized without a central log server.
Experimental search engines
- Yippy - metasearch engine that groups results in clusters.