发现Url rewriter是个好东西啊,blog也使用了这个东西(转一篇文章研究一个)
2004-06-02 09:48
841 查看
Download source - 8 Kb
As more and more websites are being rewritten with ASP.NET, the old sites which had been indexed by google and linked from other sites are lost, inevitably culminating in the dreaded 404 error. I will show how legacy ASP sites can be upgraded to ASP.NET, while maintaining links from search engines.
The storage location for all ASP.NET Configuration information is the
This section handler specifies that for every section called "urlrewrites", there is a class called
A section handler is defined as a class which implements the IConfigurationSectionHandler interface. This has one method,
None will actually work because there's one last thing we have to be aware of...
We've added the HEAD, GET and POST verbs to all files with .* file extension (ie all files) and mapped those to the ASP.NET ISAPI extension - aspnet_isapi.dll.
The complete range of mappings, including the new .* mapping.
Introduction
One of the most popular extensions to the Apache webserver has beenmod_rewrite- a filter which rewrites URLs. For example, instead of a URL such as
http://www.apache.org/BookDetails.pl?id=5you could provide a filter which accepts URLs such as
http://www.apache.org/Book/5.htmland it will silently perform a server-side redirect to the first URL. In this way, the real URL could be hidden, providing an obfuscated facade to the web page. The benefits are easier to remember URLs and increasing the difficulty of hacking a website.
Mod_rewritebecame very popular and grew to encompass a couple of other features not related to URL Rewriting, such as caching. This article demonstrates URL Rewriting with ASP.NET, whereby the requested URL is matched based on a regular expression and the URL mappings are stored in the standard ASP.NET
web.configconfiguration file. ASP.NET includes great caching facilities, so there's no need to duplicate
mod_rewrite's caching functionality.
As more and more websites are being rewritten with ASP.NET, the old sites which had been indexed by google and linked from other sites are lost, inevitably culminating in the dreaded 404 error. I will show how legacy ASP sites can be upgraded to ASP.NET, while maintaining links from search engines.
ASP.NET support for URL Rewriting
ASP.NET provides very limited support out of the box. In fact, it's support is down to a single method:void HttpContext.RewritePath(string path)which should be called during the
Application_BeginRequest()event in the
Global.asaxfile. This is fine as long as the number of URLs to rewrite is a small, finite, managable number. However most ASP sites are in some way dynamic, passing parameters in the Query String, so we require a much more configurable approach.
The storage location for all ASP.NET Configuration information is the
web.configfile, so we'd really like to specify the rewrites in there. Additionally, .Net has a fast regular expression processor, giving free and fast search and replace of URLs. Let's define a section in the
web.configfile which specifies those rewrites:
<configuration> <system.web> <urlrewrites> <rule> <url>/urlrewriter/show\.asp</url> <rewrite>show.aspx</rewrite> </rule> <rule> <url>/urlrewriter/wohs\.asp</url> <rewrite>show.aspx</rewrite> </rule> <rule> <url>/urlrewriter/show(.*)\.asp</url> <rewrite>show.aspx?$1</rewrite> </rule> <rule> <url>/urlrewriter/(.*)show\.html</url> <rewrite>show.aspx?id=$1&cat=2</rewrite> </rule> <rule> <url>/urlrewriter/s/h/o/w/(.*)\.html</url> <rewrite>/urlrewriter/show.aspx?id=$1</rewrite> </rule> </urlrewrites> </system.web> </configuration>Notice how we have to escape the period in the url element such as 'show\.asp'. This is a Regular Expression escape and it's a small price to pay for the flexibility of regular expressions. These also show how we set-up a capturing expression using (.*) in the <url> element and refer to that capture in the <rewrite> element with $1
Configuration Section Handlers
.Net's configuration mechanism requires us to write code as a "handler" for this section. Here's the code for that:<configuration> <configSections> <sectionGroup name="system.web"> <section name="urlrewrites" type="ThunderMain.URLRewriter.Rewriter, ThunderMain.URLRewriter, Version=1.0.783.30976, Culture=neutral, PublicKeyToken=7a95f6f4820c8dc3"/> </sectionGroup> </configSections> </configuration>
This section handler specifies that for every section called "urlrewrites", there is a class called
ThunderMain.URLRewriter.Rewriterwhich can be found in the ThunderMain.URLRewriter.dll assembly with the given public key token. The public key token is required because this assembly has to be placed into the GAC and therefore given a strong name.
A section handler is defined as a class which implements the IConfigurationSectionHandler interface. This has one method,
Create(), which should be implemented, and in our code that is very simple. It merely stores the urlrewrites element for later use:
public object Create(object parent, object configContext, XmlNode section) { _oRules=section; return this; }
Initiating the rewrite process
Coming back to actually rewriting the URL, as I said earlier, we need to do something in theApplication_BeginRequest()event in Global.asax - we just delegate this to another class:
protected void Application_BeginRequest(Object sender, EventArgs e){ ThunderMain.URLRewriter.Rewriter.Process(); }which calls the static method Process() on the Rewriter class. Process() first obtains a reference to the configuration section handler (which happens to be an instance of the current class) and then delegates most of the work to
GetSubstitution()- an instance method of this class.
public static void Process() { Rewriter oRewriter= (Rewriter)ConfigurationSettings.GetConfig("system.web/urlrewrites"); string zSubst=oRewriter.GetSubstitution(HttpContext.Current.Request.Path); if(zSubst.Length>0) { HttpContext.Current.RewritePath(zSubst); } }
GetSubstitution()is just as simple - iterating through all possible URL Rewrites to see if one matches. If it does, it returns the new URL, otherwise it just returns the original URL:
public string GetSubstitution(string zPath) { Regex oReg; foreach(XmlNode oNode in _oRules.SelectNodes("rule")) { oReg=new Regex(oNode.SelectSingleNode("url/text()").Value); Match oMatch=oReg.Match(zPath); if(oMatch.Success) { return oReg.Replace(zPath,oNode.SelectSingleNode("rewrite/text()").Value); } } return zPath; }
Installing the sample code
Extract the code into a URLRewriter folder, then turn this into a virtual directory using the Internet Information Services MMC control panel applet. Compile the code use the 'Make Rewriter.bat' batch script into the bin sub-folder. Then add bin/ThunderMain.URLRewriter.dll to the Global Assembly Cache by copying and pasting the dll into %WINDIR%\assembly using Windows Explorer. Finally, navigate to http://localhost/URLRewriter/default.aspx and try the demo URLs listed.None will actually work because there's one last thing we have to be aware of...
Finally
There's one major caveat with all this. If you want to process a request with a file extension other than .aspx such as .asp or .html, then you need to change IIS to pass all requests through to the ASP.NET ISAPI extension. Unfortunately, you will need physical access to the server to perform this, which prevents you from simply XCOPY deploying your code to an ISP.We've added the HEAD, GET and POST verbs to all files with .* file extension (ie all files) and mapped those to the ASP.NET ISAPI extension - aspnet_isapi.dll.
The complete range of mappings, including the new .* mapping.
相关文章推荐
- Url rewriter是个好东西,blog也使用了这个东西
- 引用自高手(http://www.cnblogs.com/coolalan/p/4263652.)---一篇如何使用csdn git的文章(作为一个混惯了svn的码农,这个真是改三观的好东西)!!!!
- 一篇關於Android layout研究的文章[轉 from:http://blog.csdn.net/archimedes_zht/article/details/5333120]
- 节日里研究了点东西,写了两篇文章——附送一个中文编程
- js中我使用hover事件当处于这个div上时显示,离开时慢慢消失,研究了很久,最后发现添加了return就可以了
- MyBatis: 获取一个用户的所有blog的基础上讲一下如何获取一篇文章机器作者的信息。
- mysql插入int字段可以使用单引号包含?今天改东西无意发现时间字段成int类型了,而且update这个字段用单引号包含,于是在MySQL上测试下能运行,求解
- 节日里研究了点东西,写了两篇文章——附送一个中文编程
- 文章来源:VRAR123网址导航_NEWS 瑞典Gleechi 企业里智能VR专家正在研究一个系统,这个系统主要是增强VR游戏的人机互动开发;也就是说游戏玩家可以通过平台提供的实时页面,通过自
- 大家好 这个事我的BLOG 站点 欢迎大家 訪问和公布文章技术的 和评论 交流技术使用
- 发现一个好玩的东西 blog地图
- 转载一篇nm命令使用的文章,虽然没用用这个方法解决但是文章很好
- 发现这个Blog的一个Bug
- 推荐一个网址,一个人的blog的一篇文章,里面列出了N多的.net下的测试工具和文章
- 现在正在搞博士的东西,导师要求将一个二维的图片转换成现实生活中的三维图片,实现仿真,这个课题真有点难度,现在还莫不着头绪,查了相关的一些知识,可就是没有一个系统的,希望大家各抒己见,提一些好的建议,好的文章
- 一篇公司管理的文章,很好:一个小公司老板的日常管理,希望能让创业的朋友学到东西(引用)
- 我将msdn上一篇关于数据库打包的文章由vb改为cs拉,好久没发言了,希望这个东西对大家有用!
- 初学设计模式(3)-----单例模式(在研究单例的线程安全问题时,发现一篇很全面的文章,直接转了)
- 今天在使用iscroll4 做一个简单触屏滚动demo,发现上下拖动的时候总是会回弹,不能看到下面的内容.这个问题苦恼了很久,终于解决
- 科学的发展不是靠这个或那个发现,也不是靠撰写或发表一篇论文,而是靠热忱的研究和大量的工作。